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COMPOSITIONS, KITS, AND METHODS FOR 
IDENTIFICATION, ASSESSMENT, PREVENTION, AND THERAPY OF 
BREAST AND OVARIAN CANCER 

5 RELATED APPLICATIONS 

The present application claims priority from U.S. provisional patent application 
serial no. 60/300,159, filed on June 21, 2001, which was abandoned on June 25, 2001, 
and from U.S. provisional patent application serial no. 60/301,351, filed on June 27, 
2001. All of the above applications are expressly incorporated by reference. 

10 

FIELD OF THE INVENTION 

The field of the invention is cancer, particularly breast and ovarian 
cancers, including diagnosis, characterization, management, and therapy of breast and 
ovarian cancers. 

15 

BACKGROUND OF THE INVENTION 

The increased number of cancer cases reported in the United States, and, 
indeed, around the world, is a major concern. Currently there are only a handful of 
treatments available for specific types of cancer, and these provide no absolute guarantee 

20 of success. In order to be most effective, these treatments require not only an early 

detection of the malignancy, but a reliable assessment of the severity of the malignancy. 

The incidence of breast cancer, a leading cause of death in women, has 
been gradually increasing in the United States over the last thirty years. In 1997, it was 
estimated that 181,000 new cases were reported in the U.S., and that 44,000 people 

25 would die of breast cancer (Parker et al 1 997, CA Cancer 1 Clin. 47:5-27; Chu et al 
1996, J. Nat, Cancer Inst. 88:1571-1579). While the pathogenesis of breast cancer is 
unclear, transformation of normal breast epithelium to a malignant phenotype may be 
the result of genetic factors, especially in women imder 30 (Miki et al, 1994, Science, 
266:66-71). The discovery and characterization of BRCAl and BRCA2 has recently 

30 expanded our knowledge of genetic factors which can contribute to familial breast 
cancer. Germ-line mutations within these two loci are associated with a 50 to 85% 
lifetime risk of breast and/or ovarian cancer (Casey, 1997, Curr. Opin Oncol 9:88-93; 
Marcus et al, 1996, Cancer 77:697-709). However, it is likely that other, non-genetic 
factors also have a significant effect on the etiology of the disease. Regardless of its 

35 origin, breast cancer morbidity and mortality increases significantly if it is not detected 
early in its progression. Thus, considerable effort has focused on the early detection of 
cellular transformation and tumor formation in breast tissue. 
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Currently, the principal manner of identifying breast cancer is through 
detection of the presence of dense tumorous tissue. This may be accomplished to 
varying degrees of effectiveness by direct examination of the outside of the breast, or 
through mammography or other X-ray imaging methods (Jatoi, 1999, Am. J. Surg, 
5 1 77:5 1 8-524). The latter approach is not without considerable cost, however. Every time 
a mammogram is taken, the patient incurs a small risk of having a breast tumor induced 
by the ionizing properties of the radiation used during the test. In addition, the process is 
expensive and the subjective interpretations of a technician can lead to imprecision, e.g., 
one study showed major clinical disagreements for about one-third of a set of 

10 mammograms that were interpreted individually by a surveyed group of radiologists. 
Moreover, many women find that undergoing a mammogram is a painful experience. 
Accordingly, the National Cancer Institute has not recommended mammograms for 
women under fifty years of age, since this group is not as likely to develop breast 
cancers as are older women. It is compelling to note, however, that while only about 

15 22% of breast cancers occur in women under fifty, data suggests that breast cancer is 
more aggressive in pre-menopausal women. 

Ovarian cancer is also responsible for significant morbidity and mortality 
in populations around the world. Ovarian cancer is classified, on the basis of clinical 
and pathological features, in three groups, namely epithelial ovarian cancer (EOC; >90% 

20 of ovarian cancer in Westem countries), germ cell tumors (circa 2-3% of ovarian 
cancer), and stromal ovarian cancer (circa 5% of ovarian cancer; Ozols et al, 1997, 
Cancer Principles and Practice of Oncology, 5th ed., DeVita et ah, Eds. pp. 1502). 
Relative to EOC, germ cell timiors and stromal ovarian cancers are more easily detected 
and treated at an early stage, translating into higher/better survival rates for patients 

25 afflicted with these two types of ovarian cancer. 

There are numerous types of ovarian tumors, some of which are benign, 
and others of which are malignant. Treatment (including non-treatment) options and 
predictions of patient outcome depend ori accurate classification of the ovarian cancer. 
Ovarian cancers are named according to the type of cells from which the cancer is 

30 derived and whether the ovarian cancer is benign or malignant. Recognized histological 
tumor types include, for example, serous, mucinous, endometrioid, and clear cell tumors. 
In addition, ovarian cancers are classified according to recognized grade and stage 
scales. 

In grade I, the tumor tissue is well differentiated from normal ovarian 
35 tissue. In grade II, tumor tissue is moderately well differentiated. In grade III, the tumor 
tissue is poorly differentiated from normal tissue, and this grade correlates with a less 
favorable prognosis than grades I and II. Stage I is generally confined within the 
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capsule surrounding one (stage lA) or both (stage IB) ovaries, although in some stage I 
(i.e. stage IC) cancers, malignant cells may be detected in ascites, in peritoneal rinse 
fluid, or on the surface of the ovaries. Stage II involves extension or metastasis of the 
tumor from one or both ovaries to other pelvic structures. In stage IIA, the tumor 
S extends or has metastasized to the uterus, the fallopian tubes, or both. Stage IIB 
involves extension of the tumor to the pelvis. Stage IIC is stage IIA or IIB in which 
malignant cells may be detected in ascites, in peritoneal rinse fluid, or on the surface of 
the ovaries. In stage III, the tumor comprises at least one malignant extension to the 
small bowel or the omentum, has formed extrapelvic peritoneal implants of microscopic 

10 (stage III A) or macroscopic (< 2 centimeter diameter, stage IIIB; > 2 centimeter 
diameter, stage IIIC) size, or has metastasized to a retroperitoneal or inguinal lymph 
node (an alternate indicator of stage IIIC). In stage IV, distant (i.e. non-peritoneal) 
metastases of the tumor can be detected. 

The durations of the various stages of ovarian cancer are not presently 

1 5 known, but are believed to be at least about a year each (Richart et al , 1 969, Am. J, 
Obstet. Gynecol 105:386). Prognosis declines with increasing stage designation. For 
example, 5-year survival rates for patients diagnosed with stage I, II, III, and IV ovarian 
cancer are 80%, 57%, 25%, and 8%, respectively. 

Despite being the third most prevalent gynecological cancer, ovarian 

20 cancer is the leading cause of death among those afflicted with gynecological cancers. 
The disproportionate mortality of ovarian cancer is attributable to a substantial absence 
of symptoms among those afflicted with early-stage ovarian cancer and to difficulty 
diagnosing ovarian cancer at an early stage. Patients afflicted with ovarian cancer most 
often present with non-specific complaints, such as abnormal vaginal bleeding, 

25 gastrointestinal symptoms, urinary tract symptoms, lower abdominal pain, and 

generalized abdominal distension. These patients rarely present with paraneoplastic 
symptoms or with symptoms which clearly indicate their affliction. Presently, less than 
about 40% of patients afflicted with ovarian cancer present with stage I or stage 11. 
Management of ovarian cancer would be significantly enhanced if the disease could be 

30 detected at an eariier stage, when treatments are much more generally efficacious. 

Ovarian cancer may be diagnosed, in part, by collecting a routine medical 
history from a patient and by performing physical examination, x-ray examination, and 
chemical and hematological studies on the patient. Hematological tests which may be 
indicative of ovarian cancer in a patient include analyses of serum levels of proteins 

35 designated CA125 and DF3 and plasma levels of lysophosphatidic acid (LP A), 

Palpation of the ovaries and ultrasound techniques (particularly including endovaginal 
ultrasound and color Doppler flow ultrasound techniques) can aid detection of ovarian 
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tumors and differentiation of ovarian cancer from benign ovarian cysts. However, a 
definitive diagnosis of ovarian cancer typically requires performing exploratory 
laparotomy of the patient. 

Potential tests for the detection of ovarian cancer (e.g., screening, reflex 
5 or monitoring) may be characterized by a number of factors. The "sensitivity" of an 
assay refers to the probability that the test will yield a positive result in an individual 
afflicted with ovarian cancer. The "specificity" of an assay refers to the probability that 
the test will yield a negative result in an individual not afflicted with ovarian cancer. 
The "positive predictive value" (PPV) of an assay is the ratio of true positive results (Le. 

10 positive assay results for patients afflicted with ovarian cancer) to all positive results (Le. 
positive assay results for patients afflicted with ovarian cancer + positive assay results 
for patients not afflicted with ovarian cancer). It has been estimated that in order for an 
assay to be an appropriate population-wide screening tool for ovarian cancer the assay 
must have a PPV of at least about 10% (Rosenthal et al, 1998, Sem, Oncol 25:315- 

15 325). It would thus be desirable for a screening assay for detecting ovarian cancer in 
patients to have a high sensitivity and a high PPV. Monitoring and reflex tests would 
also require appropriate specifications. 

Owing to the cost, limited sensitivity, and limited specificity of known 
methods of detecting ovarian cancer, screening is not presently performed for the 

20 general population. In addition, the need to perform laparotomy in order to diagnose 
ovarian cancer in patients who screen positive for indications of ovarian cancer limits 
the desirability of population-wide screening, such that a PPV even greater than 10% 
would be desirable. 

Prior use of serum CA125 level as a diagnostic marker for ovarian cancer 

25 indicated that this method exhibited insufficient specificity for use as a general screening 
method. Use of a refined algorithm for interpreting CA125 levels in serial retrospective 
samples obtained from patients improved the specificity of the method without shifting 
detection of ovarian cancer to an earlier stage (Skakes, 1995, Cancer 76:2004). 
Screening for LPA to detect gynecological cancers including ovarian cancer exhibited a 

30 sensitivity of about 96% and a specificity of about 89%. However, CA125-based 

screening methods and LPA-based screening methods are hampered by the presence of 
CA125 and LPA, respectively, in the serum of patients afflicted with conditions other 
than ovarian cancer. For example, serum CA125 levels are known to be associated with 
menstruation, pregnancy, gastrointestinal and hepatic conditions such as colitis and 

35 cirrhosis, pericarditis, renal disease, and various non-ovarian malignancies. Serum LPA 
is known, for example, to be affected by the presence of non-ovarian gynecological 
malignancies. A screening method having a greater specificity for ovarian cancer than 
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the current screening methods for CA12S and LP A could provide a population- wide 
screening for early stage ovarian cancer. 

Presently greater than about 60% of ovarian cancers diagnosed in patients 
are stage III or stage IV cancers. Treatment at these stages is largely limited to 
5 cytoreductive surgery (when feasible) and chemotherapy, both of which aim to slow the 
spread and development of metastasized tumor. Substantially all late stage ovarian 
cancer patients currently undergo combination chemotherapy as primary treatment, 
usually a combination of a platinum compound and a taxane. Median survival for 
responding patients is about one year. Combination chemotherapy involving agents 

10 such as doxorubicin, cyclophosphamide, cisplatin, hexamethylmelamine, paclitaxel, and 
methotrexate may improve survival rates in these groups, relative to single-agent 
therapies. Various recently-developed chemotherapeutic agents and treatment regimens 
have also demonstrated usefulness for treatment of advanced ovarian cancer. For 
example, use of the topoisomerase I inhibitor topectan, use of amifostine to minimize 

1 5 chemotherapeutic side effects, and use of intraperitoneal chemotherapy for patients 
having peritoneally implanted tumors have demonstrated at least limited utility. 
Presently, however, the 5-year survival rate for patients afflicted with stage III ovarian 
cancer is 25%, and the survival rate for patients afflicted with stage IV ovarian cancer is 
8%. 

20 It would therefore be beneficial to provide specific methods and reagents 

for the diagnosis, staging, prognosis, monitoring, and treatment of diseases associated 
with breast and/or ovarian cancer, or to indicate a predisposition to such for preventative 
measures. The present invention is directed towards these needs. 

25 SUMMARY OF THE INVENTION 

The invention relates to breast and/or ovarian cancer markers (hereinafter 
"markers'* or "markers of the invention"), which are listed in Tables 1-5. The invention 
provides nucleic acids and proteins that are encoded by or correspond to the markers 
(hereinafter "marker nucleic acids" and "marker proteins," respectively). Table 1 

30 provides the sequence identifiers of the sequences of such marker nucleic acids and 
proteins listed in the accompanying Sequence Listing. The invention further provides 
antibodies, antibody derivatives and antibody fragments which bind specifically with 
such proteins and/or fragments of the proteins. 

The invention also relates to various methods, reagents and kits for 

35 diagnosing, staging, prognosing, monitoring and treating cancers, particularly breast and 
ovarian cancers. "Breast cancer" and "ovarian cancer" as used herein include 
carcinomas, (e.g., carcinoma in situ, invasive carcinoma, metastatic carcinoma) and pre- 
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malignant conditions. In one embodiment, the invention provides a diagnostic method of 
assessing whether a patient has breast or ovarian cancer or has higher than normal risk 
for developing breast or ovarian cancer, comprising the steps of comparing the level of 
expression of a marker of the invention in a patient sample and the normal level of 
5 expression of the marker in a control, e,g., a sample from a patient without breast or 
ovarian cancer. A significantly higher level of expression of the marker in the patient 
sample as compared to the normal level is an indication that the patient is afflicted with 
breast or ovarian cancer or has higher than normal risk for developing breast or ovarian 
cancer. 

10 According to the invention, the markers are selected such that the positive 

predictive value of the methods of the invention is at least about 10%, preferably about 
25%, more preferably about 50% and most preferably about 90%. Also preferred for 
use in the methods of the invention are markers that are differentially expressed, as 
compared to normal breast cells, by at least two-fold in at least about 20%, more 

15 preferably about 50% and most preferably about 75% of any of the following conditions: 
stage 0 breast cancer patients, stage I breast cancer patients, stage IIA breast cancer 
patients, stage IIB breast cancer patients, stage IIIA breast cancer patients, stage IIIB 
breast cancer patients, stage IV breast cancer patients, grade I breast cancer patients, 
grade II breast cancer patients, grade III breast cancer patients, malignant breast cancer 

20 patients, ductal carcinoma breast cancer patients, and lobular carcinoma breast cancer 
patients. Further preferred for use in the methods of the invention are markers that are 
differentially expressed, as compared to normal ovarian cells, by at least two-fold in at 
least about 20%, more preferably about 50%, and most preferably about 75% of any of 
the following conditions: stage I ovarian cancer patients, stage II ovarian cancer 

25 patients, stage III ovarian cancer patients, stage IV ovarian cancer patients, grade I 
ovarian cancer patients, grade II ovarian cancer patients, grade III ovarian cancer 
patients, epithelial ovarian cancer patients, stromal ovarian cancer patients, germ cell 
ovarian cancer patients, malignant ovarian cancer patients, benign ovarian cancer 
patients, serous neoplasm ovarian cancer patients, mucinous neoplasm ovarian cancer 

30 patients, endometrioid neoplasm ovarian cancer patients and/or clear cell neoplasm 
ovarian cancer patients. 

In a preferred diagnostic method of assessing whether a patient is 
afflicted with breast or ovarian cancer {e.g., new detection ("screening"), detection of 
recurrence, reflex testing), the method comprises comparing: 

35 
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a) the level of expression of a marker of the invention in a patient sample, 

and 

b) the normal level of expression of the marker in a control non- 
cancerous breast or non-cancerous ovarian cancer sample. 

5 A significantly higher level of expression of the marker in the patient sample as 

compared to the normal level is an indication that the patient is afflicted with breast or 
ovarian cancer. In a preferred diagnostic method for breast cancer, the marker is 
selected from the markers in Table 2. In a preferred diagnostic method for ovarian 
cancer, the marker is selected from the markers in Table 3. 
10 The invention also provides methods for assessing the efficacy of a 

therapy for inhibiting breast or ovarian cancer in a patient. Such methods comprise 
comparing: 

a) expression of a marker of the invention in a first sample obtained from 
the patient prior to providing at least a portion of the therapy to the patient, and 

1 5 b) expression of the marker in a second sample obtained from the patient 

following provision of the portion of the therapy. 
A significantly lower level of expression of the marker in the second sample relative to 
that in the first sample is an indication that the therapy is efficacious for inhibiting breast 
or ovarian cancer in the patient. In a preferred method for breast cancer, the marker is 
20 selected from the markers in Table 2. In a preferred method for ovarian cancer, the 
marker is selected from the markers in Table 3. 

It will be appreciated that in these methods the "therapy" may be any 
therapy for treating breast or ovarian cancer including, but not limited to, chemotherapy, 
radiation therapy, surgical removal of tumor tissue, gene therapy and biologic therapy 
25 such as the administering of antibodies and chemokines. Thus, the methods of the 
invention may be used to evaluate a patient before, during and after therapy, for 
example, to evaluate the reduction in tumor burden. 

In a preferred embodiment, the methods are directed to therapy using a 
chemical or biologic agent. These methods comprise comparing: 
30 a) expression of a marker of the invention in a first sample obtained fi^om 

the patient and maintained in the presence of the chemical or biologic agent, and 

b) expression of the marker in a second sample obtained from the patient 
and maintained in the absence of the agent. 

A significantly lower level of expression of the marker in the second sample relative to 
35 that in the first sample is an indication that the agent is efficacious for inhibiting breast 
or ovarian cancer, in the patient. In one embodiment, the first and second samples can 
be portions of a single sample obtained from the patient or portions of pooled samples 
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obtained from the patient. In a preferred embodiment, the methods are directed to 
therapy for treating breast cancer and the marker is selected from the markers in Table 2. 
In another preferred embodiment, the methods are directed to therapy for treating 
ovarian cancer and the marker is selected from the markers in Table 3. 
5 The invention additionally provides a monitoring method for assessing 

the progression of breast or ovarian cancer in a patient, the method comprising: 

a) detecting in a patient sample at a first time point, the expression of a 
marker of the invention; 

b) repeating step a) at a subsequent time point in time: and 

10 c) comparing the level of expression detected in steps a) and b), and 

therefrom monitoring the progression of breast or ovarian cancer in the patient. 
A significantly higher level of expression of the marker in the sample at the subsequent 
time point from that of the sample at the first time point is an indication that the breast or 
ovarian cancer has progressed, whereas a significantly lower level of expression is an 

1 5 indication that the breast or ovarian cancer has regressed. In a preferred embodiment for 
breast cancer, the marker is selected from the markers in Table 2. In a preferred 
embodiment for ovarian cancer, the marker is selected from the markers in Table 3. 

The invention fiirther provides a diagnostic method for determining 
whether breast or ovarian cancer has metastasized or is likely to metastasize, the method 

20 comprising comparing: 

a) the level of expression of a marker of the invention in a patient sample, 

and 

b) the normal level (or non-metastatic level) of expression of the marker 
in a control sample. 

25 A significantly higher level of expression in the patient sample as compared to the 
normal level (or non-metastatic level) is an indication that the breast or ovarian cancer 
has metastasized or is likely to metastasize. In a preferred diagnostic method for breast 
cancer, the marker is selected from the markers in Table 2. In a preferred diagnostic 
method for ovarian cancer, the marker is selected from the markers in Table 3. 

30 The invention moreover provides a test method for selecting a 

composition for inhibiting breast or ovarian cancer in a patient. This method comprises 
the steps of: 

a) obtaining a sample comprising cancer cells from the patient; 

b) separately maintaining aliquots of the sample in the presence of a 
35 plurality of test compositions; 

c) comparing expression of a marker of the invention in each of the 
aliquots; and 
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d) selecting one of the test compositions which significantly reduces the 
level of expression of the marker in the aliquot containing that test composition, 
relative to the levels of expression of the marker in the presence of the other test 
compositions. 

5 In a preferred method for selecting a composition for inhibiting breast 

cancer, the marker is selected from the markers in Table 2. In a preferred method for 
selecting a composition for inhibiting ovarian cancer, the marker is selected from the 
markers in Table 3* 

The invention additionally provides a test method of assessing the breast 
or ovarian carcinogenic potential of a compound. This method comprises the steps of: 

a) maintaining separate aliquots of breast or ovarian cells in the presence 
and absence of the compound; and 

b) comparing expression of a marker of the invention in each of the 
aliquots. 

A significantly higher level of expression of the marker in the aliquot maintained in the 
presence of the compound, relative to that of the aliquot maintained in the absence of the 
compound, is an indication that the compound possesses breast or ovarian carcinogenic 
potential. In a preferred method for assessing breast carcinogenic potential, the marker is 
selected from the markers in Table 2. In a preferred method for assessing ovarian 
carcinogenic potential, the marker is selected from the markers in Table 3. 

In addition, the invention fiuther provides a method of inhibiting breast 
or ovarian cancer in a patient. This method comprises the steps of: 

a) obtaining a sample comprising cancer cells from the patient; 

b) separately maintaining aliquots of the sample in the presence of a 
plurality of compositions; 

c) comparing expression of a marker of the invention in each of the 
aliquots; and 

d) administering to the patient at least one of the compositions which 
significantly lowers the level of expression of the marker in the aliquot 
containing that composition, relative to the levels of expression of the marker in 
the presence of the other compositions. 

In a preferred method for breast cancer, the marker is selected from the 
markers in Table 2. In a preferred method for ovarian cancer, the marker is selected from 
the markers in Table 3. 

In the aforementioned methods, the samples or patient samples can 
comprise a breast- or ovary-associated body fluid. Breast-associated fluids include, for 
example, blood fluids, lymph and cystic fluids, as well as nipple aspirates. Ovary- 



wo 03/000012 PCT/US02/19773 

.10- 

associated body fluids include, for example, blood fluids, lymph, ascites fluids, 
gynecological fluids, cystic fluids, urine, and fluids collected by peritoneal rinsing. The 
cells may be found in an ovarian or breast tissue sample collected, for example, by an 
ovarian or breast tissue biopsy or histology section. In another embodiment, the sample 
5 comprises cells obtained from the patient. In another embodiment, the patient sample is 
in vivo. 

According to the invention, the level of expression of a marker of the 
invention in a sample can be assessed, for example, by detecting the presence in the 
sample of: 

1 0 the corresponding marker protein (e.g. , a protein having one of the 

sequences of the even numbered SEQ ID NOs. such as SEQ ID NOs: 2, 4, 6, 8, 
etc.) or a fragment of the protein (e.g. by using a reagent, such as an antibody, an 
antibody derivative, an antibody fragment or single-chain antibody, which binds 
specifically with the protein or protein fragment) 

1 5 the corresponding marker nucleic acid (e.g. a nucleotide transcript having 

one of the sequences of the odd numbered SEQ ID NOs. such as SEQ ID NOs: 1, 
3, 5, 7, etc., or a complement thereof), or a fragment of the nucleic acid (e.g. by 
contacting transcribed polynucleotides obtained from the sample with a substrate 
having affixed thereto one or more nucleic acids having the entire or a segment 

20 of the sequence of any of the odd numbered SEQ ID NOs., or a complement 

thereof) 

a metabolite which is produced directly catalyzed) or indirectly by 
the corresponding marker protein. 

According to the invention, any of the aforementioned methods may be 

25 performed using a plurality (e.g. 2, 3, 5, or 1 0 or more) of breast or ovarian cancer 

markers, including breast or ovarian cancer markers known in the art. In such methods, 
the level of expression in the sample of each of a plurality of markers, at least one of 
which is a marker of the invention, is compared with the normal level of expression of 
each of the plurality of markers in samples of the same type obtained from control 

30 humans not afflicted with breast or ovarian cancer. A significantly altered (/.e., 
increased or decreased as specified in the above-described methods using a single 
marker) level of expression in the sample of one or more markers of the invention, or 
some combination thereof, relative to that marker's corresponding normal levels, is an 
indication that the patient is afflicted with breast or ovarian cancer. For all of the 

35 aforementioned methods, the marker(s) are preferably selected such that the positive 
predictive value of the method is at least about 10%. 
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In a further aspect, the invention provides an antibody, an antibody 
derivative, or an antibody fragment, which binds specifically with a marker protein (e.g., 
a protein having the sequence of any of the even numbered SEQ ID NOs.) or a fragment 
of the protein. The invention also provides methods for making such antibody, antibody 
S derivative, and antibody fragment. Such methods may comprise immimizing a mammal 
with a protein or peptide comprising the entirety, or a segment of 10 or more amino 
acids, of a marker protein (e.g. , a protein having the sequence of any of the even 
numbered SEQ ID NOs.), wherein the protein or peptide may be obtained from a cell or 
by chemical synthesis- The methods of the invention also encompass producing 

10 monoclonal and single-chain antibodies, which would further comprise isolating 
splenocytes from the immunized mammal, fusing the isolated splenocytes with an 
immortalized cell line to form hybridomas, and screening individual hybridomas for 
those that produce an antibody that binds specifically with a marker protein or a 
fragment of the protein. 

15 The markers of the invention are predicted to code for secreted or 

extracellular proteins, as well as for other types of transmembrane proteins (e.g., integral 
membrane proteins, type I and type II transmembrane proteins, multi-transmembrane 
proteins), and are therefore attractive targets foranticancer therapy and detection 
techniques, e.g., using antibodies and derviatives. Thus, markers of Table 2 are useful 

20 targets for detecting and treating breast cancer cancers and markers of Table 3 are useful 
targets for detecting and treating ovarian cancer. Further, certain markers of the 
invention (listed in Table 4) are selectively expressed in multiple types of cancers and 
thus are useful targets for detecting and treating several types of cancers. Table 4 
indicates the usefulness of a marker as a target for a specific type of cancer with a plus 

25 sign in that cancer's column. In one embodiment. Markers 1 , 2, 3, 26 and 32 each can 
be used as a target for diagnosis and treatment of breast and lung cancers. In another 
embodiment. Markers 6, 23, 43 and 47 each can be used as a target for diagnosis and 
treatment of ovarian, breast, lung and colon cancers. In a further embodiment. Markers 
5 and 7 each can be used as a target for diagnosis and treatment of ovarian, breast, lung, 

30 colon and prostate cancers. In a further embodiment, Markers 5 and 7 each can be used 
as a target for diagnosis and treatment of ovarian, breast, lung, colon and prostate 
cancers. In yet another embodiment, Marker 22 can be used as a target for diagnosis and 
treatment of breast, lung and colon cancers. In another embodiment. Marker 36 can be 
used as a target for diagnosis and treatment of ovarian, breast and lung, cancers. In a 

35 further additional embodiment, Marker 39 can be used as a target for diagnosis and 
treatment of ovarian and lung cancers. In yet a further embodiment, Marker 45 can be 
used as a target for diagnosis and treatment of ovarian and colon cancers. In another 
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additional embodiment. Marker 56 can be used as a target for diagnosis and treatment of 
ovarian lung and colon cancers. In a preferred embodiment of the invention. Marker 7 
and Marker 32 can be used as targets for inhibiting angiogenenis associated with tumor 
growth. Antibodies, antibody derivatives, and antibody fragments which bind 
5 specifically with a marker protein of the invention (/.c, a protein comprising the 

sequence of any of the even numbered) or a fragment of the protein, may thus be used to 
treat a cancer of which the corresponding marker is a target. 

In another aspect, the invention relates to various diagnostic and test kits. 
In one embodiment, the invention provides a kit for assessing whether a patient is 

10 afflicted with breast or ovarian cancer. The kit comprises a reagent for assessing 

expression of a marker of the invention. In another embodiment, the invention provides 
a kit for assessing the suitability of a chemical or biologic agent for inhibiting an breast 
or ovarian cancer in a patient. Such kit comprises a reagent for assessing expression of a 
marker of the invention, and may also comprise one or more of such agents. In a further 

15 embodiment, the invention provides kits for assessing the presence of breast or ovarian 
cancer cells or treating breast or ovarian cancers. Such kits comprise an antibody, an 
antibody derivative, or an antibody fragment, which binds specifically with a marker 
protein, or a fragment of the protein. Such kits may also comprise a plurality of 
antibodies, antibody derivatives, or antibody fragments wherein the plurality of such 

20 antibody agents binds specifically with a marker protein, or a fragment of the protein. 

In an additional embodiment, the invention also provides a kit for 
assessing the presence of breast or ovarian cancer cells, wherein the kit comprises a 
nucleic acid probe that binds specifically with a marker nucleic acid or a fragment of the 
nucleic acid. The kit may also comprise a plurality of probes, wherein each of the 

25 probes binds specifically with a marker nucleic acid, or a fragment of the nucleic acid. 

In a further aspect, the invention relates to methods for treating a patient 
afflicted with cancer, particularly breast or ovarian cancer or at risk of developing such a 
cancer. The methods may comprise reducing the expression and/or interfering with the 
biological function of a marker of the invention so as to treat a cancer of which the 

30 marker has been identified herein as a useful diagnosis and therapeutic target. In one 
embodiment, the method comprises providing to the patient an antisense oligonucleotide 
or polynycleotide complementary to a marker nucleic acid, or a segment thereof For 
example, an antisense polynucleotide may be provided to the patient through the 
delivery of a vector that expresses an anti-sense polynucleotide of a marker nucleic acid 

35 or a fragment thereof In another embodiment, the method comprises providing to the 
patient an antibody, an antibody derivative, or antibody fragment, which binds 
specifically with a marker protein or a fragment of the protein. In a preferred 
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embodiment, the antibody, antibody derivative or antibody fragment binds specifically 
with a protein having the sequence of an even numbered SEQ ID NO., or a fragment of 
the protein. 

It will be appreciated that the methods and kits of the present invention 
5 may also include known cancer markers including known breast or ovarian cancer 
markers. It will further be appreciated that the methods and kits may be used to identify 
cancers other than breast or ovarian cancer. 

DETAILED DESCRIPTION OF THE INVENTION 

10 The invention relates to newly discovered Markers 1-56 (Table 1) 

associated with cancer and more particularly the cancerous state of breast and/or ovarian 
cells. Table 1 lists the markers of the invention, which are over-expressed in breast 
and/or ovarian cancer cells compared to normal (/.e., non-cancerous) cells and provides 
the sequence listing identifiers of the cDNA sequence of a nucleotide transcript and the 

15 amino acid sequence of a protein encoded by or corresponding to each marker. It has 
been discovered that higher than normal level of expression of any of Markers 1-33 
(Table 2) or a combination of these markers correlates with the presence of cancer, 
particularly breast cancer in a patient. Likewise, it has been discovered that higher than 
normal level of expression of any of Markers 34-56 (Table 3) or a combination of these 

20 markers correlates with the presence of cancer, particularly ovarian cancer in a patient. 
Methods are provided for detecting the presence of cancer, particularly breast or ovarian 
cancer in a sample, the absence of breast or ovarian cancer in a sample, the stage of a 
breast or ovarian cancer, and with other characteristics of breast or ovarian cancer that 
are relevant to prevention, diagnosis, characterization, and therapy of breast or ovarian 

25 cancer in a patient Methods of treating cancer, particularly breast or ovarian cancer are 
also provided. 

Definitions 

As used herein, each of the following terms has the meaning associated 
30 with it in this section. 

The articles "a" and "an" are used herein to refer to one or to more than 
one (i.e. to at least one) of the grammatical object of the article. By way of example, "an 
element" means one element or more than one element. 

A "marker" is a gene whose altered level of expression in a tissue or cell 
35 fi-om its expression level in normal or healthy tissue or cell is associated with a disease 
state, such as cancer. A "marker nucleic acid" is a nucleic acid (e.g., mRNA, cDNA) 
encoded by or corresponding to a marker of the invention. Such marker nucleic acids 
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include DNA (e.g. , cDNA) comprising the entire or a partial sequence of any of the odd 
number SEQ ID NOs. or the complement of such a sequence. The marker nucleic acids 
also include RNA comprising the entire or a partial sequence of any odd number SEQ 
ID NO. or the complement of such a sequence, wherein all thymidine residues are 
5 replaced with uridine residues. A "marker protein" is a protein encoded by or 

corresponding to a marker of the invention. A marker protein comprises the entire or a 
partial sequence of any of the even numbered SEQ ID NOs. The terms "protein" and 
"polypeptide' are used interchangeably. 

The term "probe" refers to any molecule which is capable of selectively 

10 binding to a specifically intended target molecule, for example, a nucleotide transcript or 
protein encoded by or corresponding to a marker. Probes can be either synthesized by 
one skilled in the art, or derived firom appropriate biological preparations. For purposes 
of detection of the target molecule, probes may be specifically designed to be labeled, as 
described herein. Examples of molecules that can be utilized as probes include, but are 

1 5 not limited to, RNA, DNA, proteins, antibodies, and organic molecules. 

A "breast-associated" body fluid is a fluid which, when in the body of a 
patient, contacts or passes through breast cells or into which cells or proteins shed from 
breast cells are capable of passing. Exemplary breast-associated body fluids include, for 
example, blood fluids, lymph and cystic fluids, as well as nipple aspirates. 

20 An "ovarian-associated" body fluid is a fluid which, when in the body of 

a patient contacts or passes through ovarian cells or into which cells or proteins shed 
from ovarian cells are capable of passing. Ovary-associated body fluids include, for 
example, fluids include blood fluids (e.g. whole blood, blood serum, blood having 
platelets removed therefrom, etc.), lymph, ascitic fluids, gynecological fluids {e.g. 

25 ovarian, fallopian, and uterine secretions, menses, vaginal douching fluids, fluids used to 
rinse ovarian cell samples, etc.), cystic fluid, urine, fluids collected by peritoneal rinsing 
{e,g. fluids applied and collected during laparoscopy or fluids instilled into and 
withdrawn from the peritoneal cavity of a human patient), a fluid collected by uterine 
rinsing, a uterine fluid, a uterine exudate or menses, a pleural fluid, or an ovarian 

30 exudate. 

The "normal" level of expression of a marker is the level of expression of 
the marker in breast or ovarian cells of a human subject or patient not afflicted with 
breast or ovarian cancer 

An "over-expression'' or "significantly higher level of expression" of a 
35 marker refers to an expression level in a test sample that is greater than the standard 
error of the assay employed to assess expression, and is preferably at least twice, and 
more preferably three, four, five or ten times the expression level of the marker in a 
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control sample (e.g., sample from a healthy subjects not having the marker associated 
disease) and preferably, the average expression level of the marker in several control 
samples. 

A "significantly lower level of expression" of a marker refers to an 
5 expression level in a test sample that is at least twice, and more preferably three, four, 
five or ten times lower than the expression level of the marker in a control sample (e.g., 
sample firom a healthy subjects not having the marker associated disease) and preferably, 
the average expression level of the marker in several control samples. 

As used herein, the term "promoter/regulatory sequence" means a nucleic 
10 acid sequence which is required for expression of a gene product operably linked to the 
promoter/regulatory sequence. In some instances, this sequence may be the core 
promoter sequence and in other instances, this sequence may also include an enhancer 
sequence and other regulatory elements which are required for expression of the gene 
product. The promoter/regulatory sequence may, for example, be one which expresses 
15 the gene product in a tissue-specific manner. 

A "constitutive" promoter is a nucleotide sequence which, when operably 
linked with a polynucleotide which encodes or specifies a gene product, causes the gene 
product to be produced in a living human cell under most or all physiological conditions 
of the cell. 

20 An "inducible" promoter is a nucleotide sequence which, when operably 

linked with a polynucleotide which encodes or specifies a gene product, causes the gene 
product to be produced in a living human cell substantially only when an inducer which 
corresponds to the promoter is present in the cell. 

A ''tissue-specific" promoter is a nucleotide sequence which, when 

25 operably linked with a polynucleotide which encodes or specifies a gene product, causes 
the gene product to be produced in a living human cell substantially only if the cell is a 
cell of the tissue type corresponding to the promoter. 

A "transcribed polynucleotide" or "nucleotide transcript" is a 
polynucleotide {e.g. an mRNA, hnRNA, a cDNA, or an analog of such RNA or cDNA) 

30 which is complementary to or homologous with all or a portion of a mature mRNA 
made by transcription of a marker of the invention and normal post-transcriptional 
processing (e.g. splicing), if any, of the RNA transcript, and reverse transcription of the 
RNA transcript. 

"Complementary" refers to the broad concept of sequence 

35 complementarity between regions of two nucleic acid strands or between two regions of 
the same nucleic acid strand. It is known that an adenine residue of a first nucleic acid 
region is capable of forming specific hydrogen bonds ("base pairing") with a residue of a 
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second nucleic acid region which is antiparallel to the first region if the residue is 
thymine or uracil. Similarly, it is known that a cytosine residue of a first nucleic acid 
strand is capable of base pairing with a residue of a second nucleic acid strand which is 
antiparallel to the first strand if the residue is guanine. A first region of a nucleic acid is 
5 complementary to a second region of the same or a different nucleic acid if, when the . 
two regions are arranged in an antiparallel fashion, at least one nucleotide residue of the 
first region is capable of base pairing with a residue of the second region. Preferably, 
the first region comprises a first portion and the second region comprises a second 
portion, whereby, when the first and second portions are arranged in an antiparallel 
10 fashion, at least about 50%, and preferably at least about 75%, at least about 90%, or at 
least about 95% of the nucleotide residues of the first portion are capable of base pairing 
with nucleotide residues in the second portion. More preferably, all nucleotide residues 
of the first portion are capable of base pairing with nucleotide residues in the second 
portion. 

1 5 "Homologous" as used herein, refers to nucleotide sequence similarity 

between two regions of the same nucleic acid strand or between regions of two different 
nucleic acid strands. When a nucleotide residue position in both regions is occupied by 
the same nucleotide residue, then the regions are homologous at that position. A first 
region is homologous to a second region if at least one nucleotide residue position of 

20 each region is occupied by the same residue. Homology between two regions is 

expressed in terms of the proportion of nucleotide residue positions of the two regions 
that are occupied by the same nucleotide residue. By way of example, a region having 
the nucleotide sequence 5 -ATTGCC-3' and a region having the nucleotide sequence 5*- 
TATGGC-3' share 50% homology. Preferably, the first region comprises a first portion 

25 and the second region comprises a second portion, whereby, at least about 50%, and 
preferably at least about 75%, at least about 90%, or at least about 95% of the nucleotide 
residue positions of each of the portions are occupied by the same nucleotide residue. 
More preferably, all nucleotide residue positions of each of the portions are occupied by 
the same nucleotide residue. 

30 A molecule is "fixed" or "affixed" to a substrate if it is covalently or non- 

covalently associated with the substrate such the substrate can be rinsed widi a fluid (e.g. 
standard saline citrate, pH 7.4) without a substantial fi-action of the molecule 
dissociating from the substrate. 

As used herein, a "naturally-occurring" nucleic acid molecule refers to an 

35 RNA or DNA molecule having a nucleotide sequence that occurs in an organism found 
in nature. 



wo 03/000012 



-17- 



PCT/US02/19773 



A cancer is "inhibited" if at least one symptom of the cancer is alleviated, 
terminated, slowed, or prevented. As used herein, breast or ovarian cancer is also 
"inhibited" if recurrence or metastasis of the cancer is reduced, slowed, delayed, or 
prevented. 

5 A kit is any manufacture (e.g. a package or container) comprising at least 

one reagent, e.g, a probe, for specifically detecting the expression of a marker of the 
invention. The kit may be promoted, distributed, or sold as a unit for performing the 
methods of the present invention. 

"Proteins of the invention" encompass marker proteins and their 

10 fragments; variant marker proteins and their fragments; peptides and polypeptides 

comprising an at least 1 5 amino acid segment of a marker or variant marker protein; and 
fusion proteins comprising a marker or variant marker protein, or an at least 15 amino 
acid segment of a marker or variant marker protein. 

Unless otherwise specified herewithin, the terms "antibody" and 

15 "antibodies" broadly encompass naturally-occurring forms of antibodies {e.g., IgG, IgA, 
IgM, IgE) and recombinant antibodies such as single-chain antibodies, chimeric and 
humanized antibodies and multi-specific antibodies, as well as fragments and derivatives 
of all of the foregoing, which fragments and derivatives have at least an antigenic 
binding site. Antibody derivatives may comprise a protein or chemical moiety 

20 conjugated to an antibody moiety. 

Description 

The present invention is based, in part, on newly identified markers 
which are over-expressed in breast or ovarian cancer cells as compared to their 
25 expression in normal {le, non-cancerous) breast or ovarian cells. The enhanced 

expression of one or more of these markers in breast or ovarian cells is herein correlated 
with the cancerous state of the tissue. The invention provides compositions, kits, and 
methods for assessing the cancerous state of breast or ovarian cells (e.g. cells obtained 
from a human, cultured human cells, archived or preserved human cells and in vivo 
30 cells) as well as treating patients afnicted with breast or ovarian cancer. 

The compositions, kits, and methods of the invention have the follov^ng 
uses, among others: 

1 ) assessing whether a patient is afflicted with breast or ovarian 
cancer; 

35 2) assessing the stage of breast or ovarian cancer in a human patient; 

3) assessing the grade of breast or ovarian cancer in a patient; 
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4) assessing the benign or malignant nature of breast or ovarian 
cancer in a patient; 

5) assessing the metastatic potential of breast or ovarian cancer in a 
patient; 

5 6) assessing the histological type of neoplasm associated with breast 

or ovarian cancer in a patient; 

7) making antibodies, antibody fragments or antibody derivatives 
that are useful for treating breast or ovarian cancer and/or 
assessing whether a patient is afflicted with breast or ovarian 

10 • cancer; 

8) assessing the presence of breast or ovarian cancer cells; 

9) assessing the efficacy of one or more test compounds for 
inhibiting breast or ovarian cancer in a patient; 

10) assessing the efficacy of a therapy for inhibiting breast or ovarian 
15 cancer in a patient; 

1 1 ) monitoring the progression of breast or ovarian cancer in a 
patient; 

1 2) selecting a composition or therapy for inhibiting breast or ovarian 
cancer in a patient; 

20 1 3) treating a patient afflicted with breast or ovarian cancer; 

14) inhibiting breast or ovarian cancer in a patient; 

1 5) assessing the breast or ovarian carcinogenic potential of a test 
compound; and 

1 6) preventing the onset of breast or ovarian cancer in a patient at risk 
25 for developing breast or ovariacancer. 

The invention thus includes a method of assessing whether a patient is 
afflicted with breast or ovarian cancer which includes assessing whether the patient has 
pre-metastasized breast or ovarian cancer. This method comprises comparing the level 
of expression of a marker of the invention in a patient sample and the normal level of 
30 expression of the marker in a control, e.g., a non-cancerous breast or ovarian sample. A 
significantly higher level of expression of the marker in the patient sample as compared 
to the normal level is an indication that the patient is afflicted with breast or ovarian 
cancer. 

Gene delivery vehicles, host cells and compositions (all described herein) 
35 containing nucleic acids comprising the entirety, or a segment of 1 5 or more nucleotides, 
of any of the sequences of the odd numbered SEQ ID NOs. or the complement of such 
sequences, and polypeptides comprising the entirety, or a segment of 10 or more amino 
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acids, of any of the sequences of the even numbered SEQ ID NOs. are also provided by 
this invention. 

As described herein, breast or ovarian cancer in patients is associated 
with an increased level of expression of one or more markers of the invention. While, as 
5 discussed above, some of these changes in expression level result from occurrence of the 
breast or ovarian cancer, others of these changes induce, maintain, and promote the 
cancerous state of breast or ovarian cancer cells. Thus, breast or ovarian cancer 
characterized by an increase in the level of expression of one or more markers of the 
invention can be inhibited by reducing and/or interfering with the expression of the 

10 markers and/or function of the proteins encoded by those markers. 

Expression of a marker of the invention can be inhibited in a number of 
ways generally known in the art. For example, an antisense oligonucleotide can be 
provided to the breast or ovarian cancer cells in order to inhibit transcription, translation, 
or both, of the marker(s). Alternately, a polynucleotide encoding an antibody, an 

15 antibody derivative, or an antibody fragment which specifically binds a marker protein, 
and operably linked with an appropriate promoter/regulator region, can be provided to 
the cell in order to generate intracellular antibodies which will inhibit the function or 
activity of the protein. The expression and/or function of a marker may also be inhibited 
by treating the breast or ovarian cancer cell with an antibody, antibody derivative or 

20 antibody fragment that specifically binds a marker protein. Using the methods described 
herein, a variety of molecules, particularly including molecules sufficiently small that 
they are able to cross the cell membrane, can be screened in order to identify molecules 
which inhibit expression of a marker or inhibit the function of a marker protein. The 
compound so identified can be provided to the patient in order to inhibit breast or 

25 ovarian cancer cells of the patient. 

Any marker or combination of markers of the invention, as well as any 
known markers in combination with the markers of the invention, may be used in the 
compositions, kits, and methods of the present invention. In general, it is preferable to 
use markers for which the difference between the level of expression of the marker in 

30 breast or ovarian cancer cells and the level of expression of the same marker in normal 
breast or ovarian cells is as great as possible. Although this difference can be as small as 
the limit of detection of the method for assessing expression of the marker, it is preferred 
that the difference be at least greater than the standard error of the assessment method, 
and preferably a difference of at least 2-, 3-, 4-, 5-, 6-, 7-, 8-, 9-, 10-, 15-, 20-, 25-, 100-, 

35 500-, 1 000-fold or greater than the level of expression of the same marker in normal 
breast or ovarian tissue. 
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The marker proteins of the present invention are transmembrane proteins 
and are therefore extremely useful in the compositions, kits, and methods of the 
invention, owing to the fact that the such marker proteins can be detected in a breast or 
ovary-associated body fluid sample, which may be more easily collected from a human 
S patient than a tissue biopsy sample. In addition, preferred in vivo techniques for 
detection of a marker protein include introducing into a subject a labeled antibody 
directed against the protein. For example, the antibody can be labeled with a radioactive 
marker whose presence and location in a subject can be detected by standard imaging 
techniques. Anti-cancer therapy utilizing antibodies directed against the marker proteins 
1 0 of the present invention is also provided. In particular, it has been found that Markers 7 
and 32 are attractive targets for inhibiting breast, ovary, lung and colon tumors, as well 
as for inhibiting angiogenesis associated with tumor growth. 

It will be appreciated that patient samples containing breast or ovarian 
cells may be used in the methods of the present invention. In these embodiments, the 
15 level of expression of the marker can be assessed by assessing the amount (e.g. absolute 
amount or concentration) of the marker in a breast or ovarian cell sample, e.g., breast or 
ovarian tissue biopsy obtained from a patient. The cell sample can, of course, be 
subjected to a variety of well-known post-collection preparative and storage techniques 
(e,g., nucleic acid and/or protein extraction, fixation, storage, freezing, ultrafiltration, 
20 concentration, evaporation, centrifiigation, etc.) prior to assessing the amount of the 
marker in the sample. 

Expression of a marker of the invention may be assessed by any of a wide 
variety of well known methods for detecting expression of a transcribed nucleic acid or 
protein. Non-limiting examples of such methods include inununological methods for 
25 detection of secreted, cell-surface, cytoplasmic, or nuclear proteins, protein purification 
methods, protein function or activity assays, nucleic acid hybridization methods, nucleic 
acid reverse transcription methods, and nucleic acid amplification methods. 

In a preferred embodiment, expression of a marker is assessed using an 
antibody (e.g. a radio-labeled, chromophore-labeled, fluorophore-labeled, or enzyme- 
30 labeled antibody), an antibody derivative (e.g. an antibody conjugated with a substrate or 
with the protein or ligand of a protein-ligand pair {e.g. biotin-streptavidin} ), or an 
antibody fragment (e.g. a single-chain antibody, an isolated antibody hypervariable 
domain, etc.) which binds specifically with a marker protein or fragment thereof, 
including a marker protein which has undergone all or a portion of its normal post- 
35 translational modification. 
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In another preferred embodiment, expression of a marker is assessed by 
preparing mRNA/cDNA (i.e. a transcribed polynucleotide) from cells in a patient 
sample, and by hybridizing the mRNA/cDNA with a reference polynucleotide which is a 
complement of a marker nucleic acid, or a fragment thereof. cDNA can, optionally, be 
5 amplified using any of a variety of polymerase chain reaction methods prior to 
hybridization with the reference polynucleotide; preferably, it is not amplified. 
Expression of one or more markers can likewise be detected using quantitative PGR to 
assess the level of expression of the marker(s). Alternatively, any of the many known 
methods of detecting mutations or variants {e.g. single nucleotide polymorphisms, 

10 deletions, etc.) of a marker of the invention may be used to detect occurrence of a 
marker in a patient. 

In a related embodiment, a mixture of transcribed polynucleotides 
obtained from the sample is contacted with a substrate having fixed thereto a 
polynucleotide complementary to or homologous with at least a portion (e.g. at least 7, 

15 10, 1 5, 20, 25, 30, 40, 50, 100, 500, or more nucleotide residues) of a marker nucleic 
acid. If polynucleotides complementary to or homologous with are differentially 
detectable on the substrate (e.g. detectable using different chromophores or 
fluorophores, or fixed to different selected positions), then the levels of expression of a 
plurality of markers can be assessed simultaneously using a single substrate (e.g. a "gene 

20 chip" microarray of polynucleotides fixed at selected positions). When a method of 
assessing marker expression is used which involves hybridization of one nucleic acid 
with another, it is preferred that the hybridization be performed under stringent 
hybridization conditions. 

Because the compositions, kits, and methods of the invention rely on 

25 detection of a difference in expression levels of one or more markers of the invention, it 
is preferable that the level of expression of the marker is significantly greater than the 
minimum detection limit of the method used to assess expression in at least one of 
normal breast or ovarian cells and cancerous breast or ovarian cells. 

It is understood that by routine screening of additional patient samples 

30 using one or more of the markers of the invention, it will be realized that certain of the 
markers are over-expressed in cancers of various types, including specific breast or 
ovarian cancers, as well as other cancers such as lung cancer, colon cancer, etc. For 
example, it will be confirmed that some of the markers of the invention are over- 
expressed in most (i.e. 50% or more) or substantially all (i.e. 80% or more) of breast or 

35 ovarian cancers. Furthermore, it will be confirmed that certain of the markers of the 
invention are associated with breast cancer of various stages (i.e. stage 0, 1, II, III, and 
IV breast cancers, as well as subclassifications IIA, IIB, III A, and IIIB, using the FIGO 
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Stage Grouping system for primary carcinoma of the breast; (see Breast, In: American 
Joint Committee on Cancer: AJCC Cancer Staging Manual. Lippincott-Raven 
Publishers, 5th ed., 1997, pp. 171-180), or stage I, II, III, and IV ovarian cancers, as well 
as subclassifications lA, IB, IC, IIA, IIB, IIC, IIIA, IIIB, and IIIC, using the FIGO Stage 
5 Grouping system for primary carcinoma of the ovary; 1987, Am, J. Obstet. Gynecol. 
156:236, of various histologic subtypes (e.g. serous, mucinous, endometroid, and clear 
cell subtypes, as well as subclassifications and alternate classifications adenocarcinoma, 
papillary adenocarcinoma, papillary cystadenocarcinoma, surface papillary carcinoma, 
malignant adenofibroma, cystadenofibroma, adenocarcinoma, cystadenocarcinoma, 

10 adenoacanthoma, endometrioid stromal sarcoma, mesodermal (Mullerian) mixed tumor, 
mesonephroid tumor, malignant carcinoma, Brenner tumor, mixed epithelial tumor, and 
undifferentiated carcinoma, using the WHO/FIGO system for classification of malignant 
breast and ovarian tumors; ScxxWy, Atlas of Tumor Pathology, 3d series, Washington 
DC), and various grades {i.e. grade I {well differentiated} , grade II {moderately well 

15 differentiated}, and grade III {poorly differentiated from surrounding normal tissue})). 
In addition, as a greater number of patient samples are assessed for expression of the 
markers of the invention and the outcomes of the individual patients from whom the 
samples were obtained are correlated, it will also be confirmed that altered expression of 
certain of the markers of the invention are strongly correlated with malignant cancers 

20 and that altered expression of other markers of the invention are strongly correlated with 
benign tumors. The compositions, kits, and methods of the invention are thus useful for 
characterizing one or more of the stage, grade, histological type, and benign/malignant 
nature of breast or ovarian cancer in patients. 

When the compositions, kits, and methods of the invention are used for 

25 characterizing one or more of the stage, grade, histological type, and benign/malignant 
nature of breast or ovarian cancer in a patient, it is preferred that the marker or panel of 
markers of the invention is selected such that a positive result is obtained in at least 
about 20%, and preferably at least about 40%, 60%, or 80%, and more preferably in 
substantially all patients afflicted with a breast or ovarian cancer of the corresponding 

30 stage, grade, histological type, or benign/malignant nature. Preferably, the marker or 
panel of markers of the invention is selected such that a positive predictive value (PPV) 
of greater than about 10% is obtained for the general population (more preferably 
coupled with an assay specificity greater than 80%). 

When a plurality of markers of the invention are used in the 

35 compositions, kits, and methods of the invention, the level of expression of each marker 
in a patient sample can be compared with the normal level of expression of each of the 
plurality of markers in non-cancerous samples of the same type, either in a single 
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reaction mixture (i.e. using reagents, such as different fluorescent probes, for each 
marker) or in individual reaction mixtures corresponding to one or more of the markers. 
In one embodiment, a significantly increased level of expression of more than one of the 
plurality of markers in the sample, relative to the corresponding normal levels, is an 
5 indication that the patient is afflicted with breast or ovarian cancer. When a plurality of 
markers is used, it is preferred that 2, 3, 4, 5, 8, 10, 12, 15, 20, 30, or 50 or more 
individual markers be used, wherein fewer markers are preferred. 

In order to maximize the sensitivity of the compositions, kits, and 
methods of the invention (i.e. by interference attributable to cells of non-breast or 

10 ovarian origin in a patient sample), it is preferable that the marker of the invention used 
therein be a marker which has a restricted tissue distribution, e.g., normally not 
expressed in a non- breast or ovarian tissue. 

Only a small number of markers are known to be associated with breast 
or ovarian cancers (e.g., for breast: BRCAJ and BRCA2; and, for ovarian: AKT2, Ki- 

15 HAS, ERBB2, c-MYC, RBI, and TP 53). These markers are not, of course, included 
among the markers of the invention, although they may be used together with one or 
more markers of the invention in a panel of markers, for example. It is well known that 
certain types of genes, such as oncogenes, tumor suppressor genes, growth factor-like 
genes, protease-like genes, and protein kinase-like genes are often involved with 

20 development of cancers of various types. Thus, among the markers of the invention, use 
of those which correspond to proteins which resemble known proteins encoded by 
known oncogenes and tumor suppressor genes, and those which correspond to proteins 
which resemble growth factors, proteases, and protein kinases are preferred. 

It is recognized that the compositions, kits, and methods of the invention 

25 will be of particular utility to patients having an enhanced risk of developing breast or 
ovarian cancer and their medical advisors. Patients recognized as having an enhanced 
risk of developing breast or ovarian cancer include, for example, patients having a 
familial history of breast or ovarian cancer, patients identified as having a mutant 
oncogene (i.e. at least one allele), and patients of advancing age (i.e. women older than 

30 about 50 or 60 years). 

The level of expression of a marker in normal {i.e. non-cancerous) human 
breast or ovarian tissue can be assessed in a variety of ways. In one embodiment, this 
normal level of expression is assessed by assessing the level of expression of the marker 
in a portion of breast or ovarian cells which appears to be non-cancerous and by 

35 comparing this normal level of expression with the level of expression in a portion of the 
breast or ovarian cells which is suspected of being cancerous. Altemately, and 
particularly as further information becomes available as a result of routine performance 
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of the methods described herein, population-average values for normal expression of the 
markers of the invention may be used. In other embodiments, the 'normal' level of 
expression of a marker may be determined by assessing expression of the marker in a 
patient sample obtained from a non-cancer-afilicted patient, from a patient sample 
5 obtained from a patient before the suspected onset of breast or ovarian cancer in the 
patient, from archived patient samples, and the like. 

The invention includes compositions, kits, and methods for assessing the 
presence of breast or ovarian cancer cells in a sample (e.g. an archived tissue sample or a 
sample obtained from a patient). These compositions, kits, and methods are 

1 0 substantially the same as those described above, except that, where necessary, the 
compositions, kits, and methods are adapted for use with samples other than patient 
samples. For example, when the sample to be used is a parafinized, archived human 
tissue sample, it can be necessary to adjust the ratio of compounds in the compositions 
of the invention, in the kits of the invention, or the methods used to assess levels of 

15 marker expression in the sample. Such methods are well knovm in the art and within the 
skill of the ordinary artisan. 

The invention includes a kit for assessing the presence of breast or 
ovarian cancer cells (e.g. in a sample such as a patient sample). The kit comprises a 
plurality of reagents, each of which is capable of binding specifically with a marker 

20 nucleic acid or protein. Suitable reagents for binding with a marker protein include 

antibodies, antibody derivatives, antibody fragments, and the like. Suitable reagents for 
binding with a marker nucleic acid (e.g. a genomic DNA, an mRNA, a spliced mRNA, a 
cDNA, or the like) include complementary nucleic acids. For example, the nucleic acid 
reagents may include oligonucleotides (labeled or non-labeled) fixed to a substrate, 

25 labeled oligonucleotides not bound with a substrate, pairs of PGR primers, molecular 
beacon probes, and the like. 

The kit of the invention may optionally comprise additional components 
usefiil for performing the methods of the invention. By way of example, the kit may 
comprise fluids (e.g. SSC buffer) suitable for annealing complementary nucleic acids or 

30 for binding an antibody with a protein with which it specifically binds, one or more 
sample compartments, an instructional material which describes performance of a 
method of the invention, a sample of normal breast or ovarian cells, a sample of breast 
or ovarian cancer cells, and the like. 

The invention also includes a method of making an isolated hybridoma 

35 which produces an antibody useful for assessing whether patient is afflicted with an 
breast or ovarian cancer. In this method, a protein or peptide comprising the entirety or 
a segment of a marker protein is synthesized or isolated (e.g. by purification from a cell 
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in which it is expressed or by transcription and translation of a nucleic acid encoding the 
protein or peptide in vivo or in vitro using known methods). A vertebrate, preferably a 
mammal such as a mouse, rat, rabbit, or sheep, is inununized using the protein or 
peptide. The vertebrate may optionally (and preferably) be immunized at least one 
5 additional time with the protein or peptide, so that the vertebrate exhibits a robust 
immune response to the protein or peptide. Splenocytes are isolated from the 
immunized vertebrate and fused with an immortalized cell line to form hybridomas, 
using any of a variety of methods well known in the art. Hybridomas formed in this 
manner are then screened using standard methods to identify one or more hybridomas 

10 which produce an antibody which specifically binds with the marker protein or a 
fragment thereof. The invention also includes hybridomas made by this method and 
antibodies made using such hybridomas. 

The invention also includes a method of assessing the efficacy of a test 
compound for inhibiting breast or ovarian cancer cells. As described above, differences 

15 in the level of expression of the markers of the invention correlate with the cancerous 
state of breast or ovarian cells. Although it is recognized that changes in the levels of 
expression of certain of the markers of the invention likely result from the cancerous 
state of breast or ovarian cells, it is likewise recognized that changes in the levels of 
expression of other of the markers of the invention induce, maintain, and promote the 

20 cancerous state of those cells. Thus, compounds which inhibit an breast or ovarian 

cancer in a patient will cause the level of expression of one or more of the markers of the 
invention to change to a level nearer the normal level of expression for that marker (Le. 
the level of expression for the marker in non-cancerous breast or ovarian cells). 

This method thus comprises comparing expression of a marker in a first 

25 breast or ovarian cell sample and maintained in the presence of the test compound and 
expression of the marker in a second breast or ovarian cell sample and maintained in the 
absence of the test compound. A significantly reduced expression of a marker of the 
invention in the presence of the test compound is an indication that the test compound 
inhibhs breast or ovarian cancer. The breast or ovarian cell samples may, for example, 

30 be aliquots of a single sample of normal breast or ovarian cells obtained from a patient, 
pooled samples of normal breast or ovarian cells obtained from a patient, cells of a 
normal breast or ovarian cell line, aliquots of a single sample of breast or ovarian cancer 
cells obtained from a patient, pooled samples of breast or ovarian cancer cells obtained 
from a patient, cells of an breast or ovarian cancer cell line, or the like. In one 

35 embodiment, the samples are breast or ovarian cancer cells obtained from a patient and a 
plurality of compounds known to be effective for inhibiting various breast or ovarian 
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cancers are tested in order to identify the compound which is likely to best inhibit the 
breast or ovarian cancer in the patient. 

This method may likewise be used to assess the efficacy of a therapy for 
inhibiting breast or ovarian cancer in a patient. In this method, the level of expression of 
5 one or more markers of the invention in a pair of samples (one subjected to the therapy, 
the other not subjected to the therapy) is assessed. As with the method of assessing the 
efficacy of test compounds, if the therapy induces a significantly lower level of 
expression of a marker of the invention then the therapy is efficacious for inhibiting 
breast or ovarian cancer. As above, if samples from a selected patient are used in this 

10 method, then alternative therapies can be assessed in vitro in order to select a therapy 
most likely to be efficacious for inhibiting breast or ovarian cancer in the patient. 

As described above, the cancerous state of human breast or ovarian cells 
is correlated with changes in the levels of expression of the markers of the invention. 
The invention includes a method for assessing the human breast or ovarian cell 

1 5 carcinogenic potential of a test compound. This method comprises maintaining separate 
aliquots of human breast or ovarian cells in the presence and absence of the test 
compound. Expression of a marker of the invention in each of the aliquots is compared. 
A significantly higher level of expression of a marker of the invention in the aliquot 
maintained in the presence of the test compound (relative to the aliquot maintained in 

20 the absence of the test compound) is an indication that the test compound possesses 

human breast or ovarian cell carcinogenic potential. The relative carcinogenic potentials 
of various test compounds can be assessed by comparing the degree of enhancement or 
inhibition of the level of expression of the relevant markers, by comparing the number of 
markers for which the level of expression is enhanced or inhibited, or by comparing 

25 both. 

Various aspects of the invention are described in fiirther detail in the 
following subsections. 

L Isolated Nucleic Acid Molecules 

30 One aspect of the invention pertains to isolated nucleic acid molecules, 

including nucleic acids which encode a marker protein or a portion thereof Isolated 
nucleic acids of the invention also include nucleic acid molecules sufficient for use as 
hybridization probes to identify marker nucleic acid molecules, and fragments of marker 
nucleic acid molecules, e.g., those suitable for use as PGR primers for the amplification 

35 or mutation of marker nucleic acid molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DNA molecules (e.g., cDNA or genomic DNA) and 
RNA molecules (e.g., mRNA) and analogs of the DNA or RNA generated using 
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nucleotide analogs. The nucleic acid molecule can be single-stranded or double- 
stranded, but preferably is double-stranded DNA. 

An "isolated" nucleic acid molecule is one which is separated from other 
nucleic acid molecules which are present in the natural source of the nucleic acid 
5 molecule. Preferably, an "isolated" nucleic acid molecule is free of sequences 
(preferably protein-encoding sequences) which naturally flank the nucleic acid (/.e., 
sequences located at the 5' and 3* ends of the nucleic acid) in the genomic DNA of the 
organism from which the nucleic acid is derived. For example, in various embodiments, 
the isolated nucleic acid molecule can contain less than about 5 kB, 4 kB, 3 kB, 2 kB, 1 

10 kB, 0.5 kB or 0. 1 kB of nucleotide sequences which naturally flank the nucleic acid 

molecule in genomic DNA of the cell from which the nucleic acid is derived. Moreover, 
an "isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free 
of other cellular material, or culture medium when produced by recombinant techniques, 
or substantially free of chemical precursors or other chemicals when chemically 

15 synthesized, 

A nucleic acid molecule of the present invention can be isolated using 
standard molecular biology techniques and the sequence information in the database 
records described herein. Using all or a portion of such nucleic acid sequences, nucleic 
acid molecules of the invention can be isolated using standard hybridization and cloning 
20 techniques (e.g., as described in Sambrook et al, ed.. Molecular Cloning: A Laboratory 
Manual, 2nd ed. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989). 

A nucleic acid molecule of the invention can be amplified using cDNA, 
mRNA, or genomic DNA as a template and appropriate oligonucleotide primers 
25 according to standard PCR amplification techniques. The nucleic acid so amplified c£m 
be cloned into an appropriate vector and characterized by DNA sequence analysis. 
Furthermore, nucleotides corresponding to all or a portion of a nucleic acid molecule of 
the invention can be prepared by standard synthetic techniques, e.g., using an automated 
DNA synthesizer. 

30 In another preferred embodiment, an isolated nucleic acid molecule of the 

invention comprises a nucleic acid molecule which has a nucleotide sequence 
complementary to the nucleotide sequence of a marker nucleic acid or to the nucleotide 
sequence of a nucleic acid encoding a marker protein. A nucleic acid molecule which is 
complementary to a given nucleotide sequence is one which is sufficiently 

35 complementary to the given nucleotide sequence that it can hybridize to the given 
nucleotide sequence thereby forming a stable duplex. 
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Moreover, a nucleic acid molecule of the invention can comprise only a 
portion of a nucleic acid sequence, wherein the full length nucleic acid sequence 
comprises a marker nucleic acid or which encodes a marker protein. Such nucleic acids 
can be used, for example, as a probe or primer. The probe/primer typically is used as 
5 one or more substantially purified oligonucleotides. The oligonucleotide typically 
comprises a region of nucleotide sequence that hybridizes under stringent conditions to 
at least about 7, preferably about 15, more preferably about 25, 50, 75, 100, 125, 150, 
175, 200, 250, 300, 350, or 400 or more consecutive nucleotides of a nucleic acid of the 
invention. 

10 Probes based on the sequence of a nucleic acid molecule of the invention 

can be used to detect transcripts or genomic sequences corresponding to one or more 
markers of the invention. The probe comprises a label group attached thereto, e.g., a 
radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes 
can be used as part of a diagnostic test kit for identifying cells or tissues which mis- 

15 express the protein, such as by measuring levels of a nucleic acid molecule encoding the 
protein in a sample of cells from a subject, e.g., detecting mRNA levels or determining 
whether a gene encoding the protein has been mutated or deleted. 

The invention further encompasses nucleic acid molecules that differ, due 
to degeneracy of the genetic code, from the nucleotide sequence of nucleic acids 

20 encoding a marker protein {e.g. , protein having the sequence of the even numbered SEQ 
ID NOs.), and thus encode the same protein. 

It will be appreciated by those skilled in the art that DNA sequence 
polymorphisms that lead to changes in the amino acid sequence can exist within a 
population (e.g., the human population). Such genetic polymorphisms can exist among 

25 individuals within a population due to natural allelic variation. An allele is one of a 
group of genes which occur alternatively at a given genetic locus. In addition, it will be 
appreciated that DNA polymorphisms that affect RNA expression levels can also exist 
that may affect the overall expression level of that gene (eg., by affecting regulation or 
degradation). 

30 As used herein, the phrase "allelic variant" refers to a nucleotide sequence 

which occurs at a given locus or to a polypeptide encoded by the nucleotide sequence. 

As used herein, the terms "gene" and "recombinant gene" refer to nucleic 
acid molecules comprising an open reading frame encoding a polypeptide corresponding 
to a marker of the invention. Such natural allelic variations can typically result in 1-5% 

35 variance in the nucleotide sequence of a given gene. Alternative alleles can be identified 
by sequencing the gene of interest in a number of different individuals. This can be 
readily carried out by using hybridization probes to identify the same genetic locus in a 
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variety of individuals. Any and all such nucleotide variations and resulting amino acid 
polymorphisms or variations that are the result of natural allelic variation and that do not 
alter the functional activity are intended to be within the scope of the invention. 

In another embodiment, an isolated nucleic acid molecule of the 
5 invention is at least 7, 15, 20, 25, 30, 40, 60, 80, 100, 150, 200, 250, 300, 350, 400, 450, 
550, 650, 700, 800, 900, 1000, 1200, 1400, 1600, 1800, 2000, 2200, 2400, 2600, 2800, 
3000, 3500, 4000, 4500, or more nucleotides in length and hybridizes under stringent 
conditions to a marker nucleic acid or to a nucleic acid encoding a marker protein. As 
used herein, the term "hybridizes under stringent conditions" is intended to describe 

10 conditions for hybridization and washing under which nucleotide sequences at least 60% 
(65%, 70%, preferably 75%) identical to each other typically remain hybridized to each 
other. Such stringent conditions are known to those skilled in the art and can be found 
in sections 6.3.1-6.3.6 of Current Protocols in Molecular Biology, John Wiley & Sons, 
N.Y. (1989). A preferred, non-limiting example of stringent hybridization conditions 

15 are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, followed 
by one or more washes in 0.2X SSC, 0.1% SDS at 50-65^C. 

In addition to naturally-occurring allelic variants of a nucleic acid 
molecule of the invention that can exist in the population, the skilled artisan will further 
appreciate that sequence changes can be introduced by mutation thereby leading to 

20 changes in the amino acid sequence of the encoded protein, without altering the 
biological activity of the protein encoded thereby. For example, one can make 
nucleotide substitutions leading to amino acid substitutions at "non-essential" amino 
acid residues. A "non-essential" amino acid residue is a residue that can be altered from 
the wild-type sequence without altering the biological activity, whereas an "essential" 

25 amino acid residue is required for biological activity. For example, amino acid residues 
that are not conserved or only semi-conserved among homologs of various species may 
be non-essential for activity and thus would be likely targets for alteration. 
Alternatively, amino acid residues that are conserved among the homologs of various 
species (e.g., murine and human) may be essential for activity and thus would not be 

30 likely targets for alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid 
molecules encoding a variant marker protein that contain changes in amino acid residues 
that are not essential for activity. Such variant marker proteins differ in amino acid 
sequence from the naturally-occurring marker proteins, yet retain biological activity. In 

35 one embodiment, such a variant marker protein has an amino acid sequence that is at 
least about 40% identical, 50%, 60%, 70%, 80%, 90%, 95%, or 98% identical to the 
amino acid sequence of a marker protein. 
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An isolated nucleic acid molecule encoding a variant marker protein can 
be created by introducing one or more nucleotide substitutions, additions or deletions 
into the nucleotide sequence of marker nucleic acids, such that one or more amino acid 
residue substitutions, additions, or deletions are introduced into the encoded protein. 
5 Mutations can be introduced by standard techniques, such as site-directed mutagenesis 
and PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are 
made at one or more predicted non-essential amino acid residues. A "conservative 
amino acid substitution" is one in which the amino acid residue is replaced with an 
amino acid residue having a similar side chain. Families of amino acid residues having 

10 similar side chains have been defined in the art. These families include amino acids 
with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic 
acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, 
serine, threonine, tyrosine, cysteine), non-polar side chains (e.g., alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains 

1 5 (e.g. , threonine, valine, isoleucine) and aromatic side chains (e.g. , tyrosine, 

phenylalanine, tryptophan, histidine). Alternatively, mutations can be introduced 
randomly along all or part of the coding sequence, such as by saturation mutagenesis, 
and the resultant mutants can be screened for biological activity to identify mutants that 
retain activity. Following mutagenesis, the encoded protein can be expressed 

20 recombinantly and the activity of the protein can be determined. 

The present invention encompasses antisense nucleic acid molecules, i.e., 
molecules which are complementary to a sense nucleic acid of the invention, e.g., 
complementary to the coding strand of a double-stranded marker cDNA molecule or 
complementary to a marker mRNA sequence. Accordingly, an antisense nucleic acid of 

25 the invention can hydrogen bond to (i.e. anneal with) a sense nucleic acid of the 

invention. The antisense nucleic acid can be complementary to an entire coding strand, 
or to only a portion thereof, e.g., all or part of the protein coding region (or open reading 
fi-ame). An antisense nucleic acid molecule can also be antisense to all or part of a non- 
coding region of the coding strand of a nucleotide sequence encoding a marker protein. 

30 The non-coding regions ("5* and 3* untranslated regions") are the 5' and 3* sequences 
which flank the coding region and are not translated into amino acids. 

An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 
30, 35, 40, 45, or 50 or more nucleotides in length. An antisense nucleic acid of the 
invention can be constructed using chemical synthesis and enzymatic ligation reactions 

35 using procedures known in the art. For example, an antisense nucleic acid (e.g. , an 
antisense oligonucleotide) can be chemically synthesized using naturally occurring 
nucleotides or variously modified nucleotides designed to increase the biological 
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Stability of the molecules or to increase the physical stability of the duplex formed 
between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives and 
acridine substituted nucleotides can be used. Examples of modified nucleotides which 
can be used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 
5 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5- 

(carboxyhydroxylmethyl) uracil, 5"CarboxymethylaminomethyI-2-thiouridine, 5- 
carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1 -methylinosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methyiguanine, 3-methyicytosine, 5-methylcytosine, N6-adenine, 7- 

10 methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 
methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl- 

15 2-thiouracii, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 
Alternatively, the antisense nucleic acid can be produced biologically using an 
expression vector into which a nucleic acid has been sub-cloned in an antisense 
orientation (z.e., RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 

20 subsection). 

The antisense nucleic acid molecules of the invention are typically 
administered to a subject or generated in situ such that they hybridize with or bind to 
cellular mRNA and/or genomic DNA encoding a marker protein to thereby inhibit 
expression of the marker, e.g. , by inhibiting transcription and/or translation. The 

25 hybridization can be by conventional nucleotide complementarity to form a stable 

duplex, or, for example, in the case of an antisense nucleic acid molecule which binds to 
DNA duplexes, through specific interactions in the major groove of the double helix. 
Examples of a route of administration of antisense nucleic acid molecules of the 
invention includes direct injection at a tissue site or infusion of the antisense nucleic acid 

30 into a breast-or ovary- associated body fluid. Alternatively, antisense nucleic acid 
molecules can be modified to target selected cells and then administered systemically. 
For example, for systemic administration, antisense molecules can be modified such that 
they specifically bind to receptors or antigens expressed on a selected cell surface, e.g, 
by linking the antisense nucleic acid molecules to peptides or antibodies which bind to 

35 cell surface receptors or antigens. The antisense nucleic acid molecules can also be 
delivered to cells using the vectors described herein. To achieve sufficient intracellular 
concentrations of the antisense molecules, vector constructs in which the antisense 
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nucleic acid molecule is placed under the control of a strong pol II or pol III promoter 
are preferred. 

An antisense nucleic acid molecule of the invention can be an a-anomeric 
nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific double- 
5 stranded hybrids with complementary RNA in which, contrary to the usual a-units, the 
strands run parallel to each other (Gaultier et aL, 1 987, Nucleic Acids Res, 1 5:6625- 
6641). The antisense nucleic acid molecule can also comprise a 2'-o- 
methylribonucleotide (Inoue etai, \9%1, Nucleic Acids Res. 15:6131-6148) or a 
chimeric RNA-DNA analogue (Inouee/ a/., \m, FEES Lett. 215:327-330). 

10 The invention also encompasses ribozymes. Ribozymes are catalytic 

RNA molecules with ribonuclease activity which are capable of cleaving a single- 
stranded nucleic acid, such as an mRNA, to which they have a complementary region. 
Thus, ribozymes (e.g., hammerhead ribozymes as described in Haselhoff and Gerlach, 
1988, Nature 334:585-591) can be used to catalyticaily cleave mRNA transcripts to 

1 5 thereby inhibit translation of the protein encoded by the mRNA. A ribozyme having 
specificity for a nucleic acid molecule encoding a marker protein can be designed based 
upon the nucleotide sequence of a cDNA corresponding to the marker. For example, a 
derivative of a Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide 
sequence of the active site is complementary to the nucleotide sequence to be cleaved 

20 (see Cech et at. U.S. Patent No. 4,987,071; and Cech et al U.S. Patent No. 5,1 16,742). 
Alternatively, an mRNA encoding a polypeptide of the invention can be used to select a 
catalytic RNA having a specific ribonuclease activity from a pool of RNA molecules 
(see, e.g., Bartel and Szostak, 1993, Science 261 :141 1-1418). 

The invention also encompasses nucleic acid molecules which form triple 

25 helical structures. For example, expression of a marker of the invention can be inhibited 
by targeting nuclisotide sequences complementary to the regulatory region of the gene 
encoding the marker nucleic acid or protein (e.g., the promoter and/or enhancer) to form 
triple helical structures that prevent transcription of the gene in target cells. See 
generally Helene {\99\) Anticancer Drug Des. 6(6):569-84; Helene (1992) Ann, K K 

30 Acad Sci. 660:27-36; and Maher (1992) Bioassays 14(12):807.15. 

In various embodiments, the nucleic acid molecules of the invention can 
be modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., 
the stability, hybridization, or solubility of the molecule. For example, the deoxyribose 
phosphate backbone of the nucleic acids can be modified to generate peptide nucleic 

35 acids (see Hyrup et al, 1996, Bioorganic & Medicinal Chemistry 4(1): 5-23). As used 
herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., 
DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
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pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA 
under conditions of low ionic strength. The synthesis of PN A oligomers can be 
performed using standard solid phase peptide synthesis protocols as described in Hyrup 
5 et al (1996), supra\ Feny-O'Keefe et al (1996) Proc, Natl Acad. ScL USA 93:14670- 
675. 

PNAs can be used in therapeutic and diagnostic applications. For 
example, PNAs can be used as antisense or antigene agents for sequence-specific 
modulation of gene expression by, e.g., inducing transcription or translation arrest or 

10 inhibiting replication. PNAs can also be used, e.g., in the analysis of single base pair 
mutations in a gene by, e.g., PNA directed PGR clamping; as artificial restriction 
enzymes when used in combination with other enzymes, e.g., SI nucleases (Hyrup 
(1996), supra\ or as probes or primers for DNA sequence and hybridization (Hyrup, 
1996, supra\ Perry-O'Keefe et al, 1996, Proc. Natl Acad. ScL USA 93:14670-675). 

1 5 In another embodiment, PNAs can be modified, e.g. , to enhance their 

stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of 
drug delivery known in the art. For example, PNA-DNA chimeras can be generated 
which can combine the advantageous properties of PNA and DNA. Such chimeras 

20 allow DNA recognition enzymes, e.g. , RNase H and DNA polymerases, to interact with 
the DNA portion while the PNA portion would provide high binding affinity and 
specificity, PNA-DNA chimeras can be linked using linkers of appropriate lengths 
selected in terms of base stacking, number of bonds between the nucleobases, and 
orientation (Hyrup, 1996, supra). The synthesis of PNA-DNA chimeras can be 

25 performed as described in Hyrup (1996), supra, and Finn et al (1996) Nucleic Acids 
Res. 24(1 7):3357-63. For example, a DNA chain can be synthesized on a solid support 
using standard phosphoramidite coupling chemistry and modified nucleoside analogs. 
Compounds such as 5 -(4-methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite can 
be used as a link between the PNA and the 5* end of DNA (Mag et al, 1989, Nucleic 

30 Acids Res. 1 7:5973-88). PNA monomers are then coupled in a step-wise manner to 
produce a chimeric molecule with a 5' PNA segment and a 3' DNA segment (Finn et al, 
1996, Nucleic Acids Res, 24(17):3357-63). Alternatively, chimeric molecules can be 
synthesized with a 5' DNA segment and a 3* PNA segment (Peterser et al, 1975, 
Bioorganic Med. Chem. Lett. 5:1119-11 124). 

35 In other embodiments, the oligonucleotide can include other appended 

groups such as peptides (e.g., for targeting host cell receptors in vivo), or agents 
facilitating transport across the cell membrane (see, e.g., Letsinger et al, 1989, Proc. 
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Natl Acad ScL USA 86:6553-6556; Lemaitre et al, 1987, Proc. Natl Acad. ScL USA 
84:648-652; PCT Publication No. WO 88/09810) or the blood-brain barrier (see, e,g,, 
PCT Publication No. WO 89/10134). In addition, oligonucleotides can be modified with 
hybridization-triggered cleavage agents (see, e.g., Krol et ai, 1988, Bio/Techniques 
5 6:958-976) or intercalating agents (see, e.g., Zon, 1988, Pharm. Res. 5:539-549). To 
this end, the oligonucleotide can be conjugated to another molecule, ^.g., a peptide, 
hybridization triggered cross-linking agent, transport agent, hybridization-triggered 
cleavage agent, etc. 

The invention also includes molecular beacon nucleic acids having at 

10 least one region which is complementary to a nucleic acid of the invention, such that the 
molecular beacon is useful for quantitating the presence of the nucleic acid of the 
invention in a sample. A "molecular beacon" nucleic acid is a nucleic acid comprising a 
pair of complementary regions and having a fluorophore and a fluorescent quencher 
associated therewith. The fluorophore and quencher are associated with different 

1 5 portions of the nucleic acid in such an orientation that when the complementary regions 
are annealed with one another, fluorescence of the fluorophore is quenched by the 
quencher. When the complementary regions of the nucleic acid are not annealed with 
one another, fluorescence of the fluorophore is quenched to a lesser degree. Molecular 
beacon nucleic acids are described, for example, in U.S. Patent 5,876,930. 

20 

11. Isolated Proteins and Antibodies 

One aspect of the invention pertains to isolated marker proteins and 
biologically active portions thereof, as well as polypeptide fi-agments suitable for use as 
immunogens to raise antibodies directed against a marker protein or a fragment thereof. 

25 In one embodiment, the native marker protein can be isolated from cells or tissue 
sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, a protein or peptide comprising the whole or a 
segment of the marker protein is produced by recombinant DNA techniques. Alternative 
to recombinant expression, such protein or peptide can be synthesized chemically using 

30 standard peptide synthesis techniques. 

An "isolated" or "purified" protein or biologically active portion thereof 
is substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the protein is derived, or substantially free of chemical 
precursors or other chemicals when chemically synthesized. The language 

35 "substantially free of cellular material" includes preparations of protein in which the 
protein is separated from cellular components of the cells from which it is isolated or 
recombinantly produced. Thus, protein that is substantially free of cellular material 
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includes preparations of protein having less than about 30%, 20%, 10%, or 5% (by dry 
weight) of heterologous protein (also referred to herein as a "contaminating protein"). 
When the protein or biologically active portion thereof is recombinantly produced, it is 
also preferably substantially free of culture medium, /.e., culture medium represents less 
5 than about 20%, 10%, or 5% of the volume of the protein preparation. When the protein 
is produced by chemical synthesis, it is preferably substantially free of chemical 
precursors or other chemicals, i.e., it is separated from chemical precursors or other 
chemicals which are involved in the synthesis of the protein. Accordingly such 
preparations of the protein have less than about 30%, 20%, 10%, 5% (by dry weight) of 

10 chemical precursors or compounds other than the polypeptide of interest. 

Biologically active portions of a marker protein include polypeptides 
comprising amino acid sequences sufficiently identical to or derived from the amino 
acid sequence of the marker protein, which include fewer amino acids than the full 
length protein, and exhibit at least one activity of the corresponding full-length protein. 

1 5 Typically, biologically active portions comprise a domain or motif with at least one 
activity of the corresponding full-length protein. A biologically active portion of a 
marker protein of the invention can be a polypeptide which is, for example, 1 0, 25, 50, 
1 00 or more amino acids in length. Moreover, other biologically active portions, in 
which other regions of the marker protein are deleted, can be prepared by recombinant 

20 techniques and evaluated for one or more of the functional activities of the native form 
of the marker protein. 

Preferred marker proteins are encoded by nucleotide sequences 
comprising the sequence of any of the even numbered SEQ ID NOs. Other useful 
proteins are substantially identical (e.g., at least about 40%, preferably 50%, 60%, 70%, 

25 80%, 90%, 95%, or 99%) to one of these sequences and retain the functional activity of 
the corresponding naturally-occurring marker protein yet differ in amino acid sequence 
due to natural allelic variation or mutagenesis. 

To determine the percent identity of two amino acid sequences or of two 
nucleic acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can 

30 be introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
alignment with a second amino or nucleic acid sequence). The amino acid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are then 
compared. When a position in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in the second sequence, then the 

35 molecules are identical at that position. The percent identity between the two sequences 
is a function of the number of identical positions shared by the sequences (i.e., % 
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identity = # of identical positions/total # of positions (e.g,, overlapping positions) xlOO). 
In one embodiment the two sequences are the same length. 

The determination of percent identity between two sequences can be 
accomplished using a mathematical algorithm. A preferred, non-limiting example of a 
5 mathematical algorithm utilized for the comparison of two sequences is. the algorithm of 
Karlin and Altschul (1990) Proc, Natl. Acad Sci. USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proc, Natl. Acad Sci. USA 90:5873-5877. Such an 
algorithm is incorporated into the BLASTN and BLASTX programs of Ahschul, et al 
(1990)7. Mol Biol. 215:403-410. BLAST nucleotide searches can be performed with 

10 the BLASTN program, score = 100, wordlength = 1 2 to obtain nucleotide sequences 
homologous to a nucleic acid molecules of the invention. BLAST protein searches can 
be performed with the BLASTP program, score = 50, wordlength = 3 to obtain amino 
acid sequences homologous to a protein molecules of the invention. To obtain gapped 
alignments for comparison purposes, a newer version of the BLAST algorithm called 

15 Gapped BLAST can be utilized as described in Altschul et al (1997) Nucleic Acids Res, 
25:3389-3402, which is able to perform gapped local alignments for the programs 
BLASTN, BLASTP and BLASTX. Alternatively, PSI-Blast can be used to perform an 
iterated search which detects distant relationships between molecules. When utilizing 
BLAST, Gapped BLAST, and PSI-Blast programs, the default parameters of the 

20 respective programs (e.g. , BLASTX and BLASTN) can be used. See 

http://www.ncbi.nlm.nih.gov. Another preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of sequences is the algorithm of 
Myers and Miller, (1988) CABIOSA A 1-17. Such an algorithm is incorporated into the 
ALIGN program (version 2.0) which is part of the GCG sequence alignment software 

25 package. When utilizing the ALIGN program for comparing amino acid sequences, a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be 
used. Yet another useftil algorithm for identifying regions of local sequence similarity 
and alignment is the FASTA algorithm as described in Pearson and Lipraan (1988) 
Proa Natl Acad Sci USA 85:2444-2448. When using the FASTA algorithm for 

30 comparing nucleotide or amino acid sequences, a PAM120 weight residue table can, for 
example, be used with a A-tuple value of 2. 

The percent identity between two sequences can be determined using 
techniques similar to those described above, with or without allowing gaps. In 
calculating percent identity, only exact matches are counted. 

35 The invention also provides chimeric or fusion proteins comprising a 

marker protein or a segment thereof As used herein, a "chimeric protein" or "fusion 
protein" comprises all or part (preferably a biologically active part) of a marker protein 
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operably linked to a heterologous polypeptide (i.e., a polypeptide other than the marker 
protein). Within the fusion protein, the term "operably linked" is intended to indicate 
that the marker protein or segment thereof and the heterologous polypeptide are fused 
in-frame to each other. The heterologous polypeptide can be fused to the amino- 
5 terminus or the carboxyl-terminus of the marker protein or segment. 

One useful fusion protein is a GST fusion protein in which a marker 
protein or segment is fused to the carboxyl terminus of GST sequences. Such fusion 
proteins can facilitate the purification of a recombinant polypeptide of the invention. 

In another embodiment, the fusion protein contains a heterologous signal 

10 sequence at its amino terminus. For example, the native signal sequence of a marker 
protein can be removed and replaced with a signal sequence from another protein. For 
example, the gp67 secretory sequence of the baculovirus envelope protein can be used as 
a heterologous signal sequence (Ausubel et al , ed., Current Protocols in Molecular 
Biology, John Wiley & Sons, NY, 1992). Other examples of eukaryotic heterologous 

15 signal sequences include the secretory sequences of melittin and human placental 

alkaline phosphatase (Stratagene; La Jolla, California). In yet another example, useful 
prokaryotic heterologous signal sequences include the phoA secretory signal (Sambrook 
et al, supra) and the protein A secretory signal (Pharmacia Biotech; Piscataway, New 
Jersey). 

20 In yet another embodiment, the fusion protein is an immunoglobulin 

fusion protein in which all or part of a marker protein is fused to sequences derived from 
a member of the immunoglobulin protein family. The immunoglobulin fusion proteins 
of the invention can be incorporated into pharmaceutical compositions and administered 
to a subject to inhibit an interaction between a ligand (soluble or membrane-bound) and 

25 a protein on the surface of a cell (receptor), to thereby suppress signal transduction in 
vivo. The inununoglobulin fusion protein can be used to affect the bioavailability of a 
cognate ligand of a marker protein. Inhibition of ligand/receptor interaction can be 
useful therapeutically, both for treating proliferative and differentiative disorders and for 
modulating (e.g. promoting or inhibiting) cell survival. Moreover, the immunoglobulin 

30 fusion proteins of the invention can be used as immunogens to produce antibodies 
directed against a marker protein in a subject, to purify ligands and in screening assays 
to identify molecules which inhibit the interaction of the marker protein with ligands. 

Chimeric and fusion proteins of the invention can be produced by 
standard recombinant DNA techniques. In another embodiment, the fusion gene can be 

35 synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PCR amplification of gene fragments can be carried out using anchor 
primers which give rise to complementary overhangs between two consecutive gene 
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fragments which can subsequently be annealed and re-amplified to generate a chimeric 
gene sequence (see, e.g., Ausubel et al, supra). Moreover, many expression vectors are 
commercially available that already encode a fusion moiety (e.g., a GST polypeptide). 
A nucleic acid encoding a polypeptide of the invention can be cloned into such an 
5 expression vector such that the fusion moiety is linked in-frame to the polypeptide of the 
invention. 

A signal sequence can be used to facilitate secretion and isolation of 
marker proteins. Signal sequences are typically characterized by a core of hydrophobic 
amino acids which are generally cleaved from the mature protein during secretion in one 

1 0 or more cleavage events. Such signal peptides contain processing sites that allow 
cleavage of the signal sequence from the mature proteins as they pass through the 
secretory pathway. Thus, the invention pertains to marker proteins, fusion proteins or 
segments thereof having a signal sequence, as well as to such proteins from which the 
signal sequence has been proteolytically cleaved (/.e., the cleavage products). In one 

15 embodiment, a nucleic acid sequence encoding a signal sequence can be operably linked 
in an expression vector to a protein of interest, such as a marker protein or a segment 
thereof. The signal sequence directs secretion of the protein, such as from a eukaryotic 
host into which the expression vector is transformed, and the signal sequence is 
subsequently or concurrently cleaved. The protein can then be readily purified from the 

20 extracellular medium by art recognized methods. Alternatively, the signal sequence can 
be linked to the protein of interest using a sequence which facilitates purification, such 
as with a GST domain. 

The present invention also pertains to variants of the marker proteins. 
Such variants have an altered amino acid sequence which can fimction as either agonists 

25 (mimetics) or as antagonists. Variants can be generated by mutagenesis, e.g., discrete 
point mutation or truncation. An agonist can retain substantially the same, or a subset, 
of the biological activities of the naturally occurring form of the protein. An antagonist 
of a protein can inhibit one or more of the activities of the naturally occurring form of 
the protein by, for example, competitively binding to a downstream or upstream member 

30 of a cellular signaling cascade which includes the protein of interest. Thus, specific 
biological effects can be elicited by treatment with a variant of limited function. 
Treatment of a subject with a variant having a subset of the biological activities of the 
naturally occurring form of the protein can have fewer side effects in a subject relative to 
treatment with the naturally occurring form of the protein. 

35 Variants of a marker protein which function as either agonists (mimetics) 

or as antagonists can be identified by screening combinatorial libraries of mutants, e.g., 
truncation mutants, of the protein of the invention for agonist or antagonist activity. In 
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one embodiment, a variegated library of variants is generated by combinatorial 
mutagenesis at the nucleic acid level and is encoded by a variegated gene library. A 
variegated library of variants can be produced by, for example, enzymatically ligating a 
mixture of synthetic oligonucleotides into gene sequences such that a degenerate set of 
5 potential protein sequences is expressible as individual polypeptides, or alternatively, as 
a set of larger fusion proteins {e.g., for phage display). There are a variety of methods 
which can be used to produce libraries of potential variants of the marker proteins from a 
degenerate oligonucleotide sequence. Methods for synthesizing degenerate 
oligonucleotides are known in the art (see, e,g,, Narang, 1983, Tetrahedron 39:3; Itakura 

10 et ai, 1984, Annu, Rev. Biochem. 53:323; Itakura et ai, 1984, Science 198:1056; Ike et 
aL 1 983 Nucleic Acid Res. 1 1 :477). 

In addition, libraries of segments of a marker protein can be used to 
generate a variegated population of polypeptides for screening and subsequent selection 
of variant marker proteins or segments thereof. For example, a library of coding 

1 5 sequence fragments can be generated by treating a double stranded PCR fragment of the 
coding sequence of interest with a nuclease under conditions wherein nicking occurs 
only about once per molecule, denaturing the double stranded DNA, renaturing the DNA 
to form double stranded DNA which can include sense/antisense pairs from different 
nicked products, removing single stranded portions from reformed duplexes by 

20 treatment with SI nuclease, and ligating the resuhing fragment library into an expression 
vector. By this method, an expression library can be derived which encodes amino 
terminal and internal fragments of various sizes of the protein of interest. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 

25 libraries for gene products having a selected property. The most widely used techniques, 
which are amenable to high through-put analysis, for screening large gene libraries 
typically include cloning the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting library of vectors, and expressing the 
combinatorial genes under conditions in which detection of a desired activity facilitates 

30 isolation of the vector encoding the gene whose product was detected. Recursive 

ensemble mutagenesis (REM), a technique which enhances the frequency of functional 
mutants in the libraries, can be used in combination with the screening assays to identify 
variants of a protein of the invention (Arkin and Yourvan, 1992, Proc. Natl Acad. Sci. 
USA 59:781 1-7815; Delgrave et ai, 1993, Protein Engineering 6(3):327. 331). 

35 Another aspect of the invention pertains to antibodies directed against a 

protein of the invention. In preferred embodiments, the antibodies specifically bind a 
marker protein or a fragment thereof. The terms "antibody" and "antibodies" as used 
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interchangeably herein refer to inununoglobulin molecules as well as fragments and 
derivatives thereof that comprise an immunologically active portion of an 
immunoglobulin molecule, (/.e., such a portion contains an antigen binding site which 
specifically binds an antigen, such as a marker protein, e.g., an epitope of a marker 
5 protein). An antibody which specifically binds to a protein of the invention is an 
antibody which binds the protein, but does not substantially bind other molecules in a 
sample, e.g., a biological sample, which naturally contains the protein. Examples of an 
immunologically active portion of an immunoglobulin molecule include, but are not 
limited to, single-chain antibodies (scAb), F(ab) and F(ab*)2 fragments. 

10 An isolated protein of the invention or a fragment thereof can be used as 

an immunogen to generate antibodies. The full-length protein can be used or, 
alternatively, the invention provides antigenic peptide fragments for use as immunogens. 
The antigenic peptide of a protein of the invention comprises at least 8 (preferably 1 0, 
15, 20, or 30 or more) amino acid residues of the amino acid sequence of one of the 

15 proteins of the invention, and encompasses at least one epitope of the protein such that 
an antibody raised against the peptide forms a specific immune complex with the 
protein. Preferred epitopes encompassed by the antigenic peptide are regions that are 
located on the surface of the protein, e.g., hydrophilic regions. Hydrophobicity sequence 
analysis, hydrophilicity sequence analysis, or similar analyses can be used to identify 

20 hydrophilic regions. In preferred embodiments, an isolated marker protein or firagment 
thereof is used as an immunogen. 

An immunogen typically is used to prepare antibodies by immunizing a 
suitable {i.e. immunocompetent) subject such as a rabbit, goat, mouse, or other mammal 
or vertebrate. An appropriate immunogenic preparation can contain, for example, 

25 recombinantly-expressed or chemically-synthesized protein or peptide. The preparation 
can further include an adjuvant, such as Freund*s complete or incomplete adjuvant, or a 
similar immunostimulatory agent. Preferred immunogen compositions are those that 
contain no other human proteins such as, for example, immunogen compositions made 
using a non-human host cell for recombinant expression of a protein of the invention. In 

30 such a manner, the resulting antibody compositions have reduced or no binding of 
human proteins other than a protein of the invention. 

The invention provides polyclonal and monoclonal antibodies. The term 
"monoclonal antibody" or "monoclonal antibody composition", as used herein, refers to 
a population of antibody molecules that contain only one species of an antigen binding 

35 site capable of immunoreacting with a particular epitope. Preferred polyclonal and 
monoclonal antibody compositions are ones that have been selected for antibodies 
directed against a protein of the invention. Particularly preferred polyclonal and 
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monoclonal antibody preparations are ones that contain only antibodies directed against 
a marker protein or fragment thereof. 

Polyclonal antibodies can be prepared by immunizing a suitable subject 
with a protein of the invention as an immunogen The antibody titer in the immunized 
5 subject can be monitored over time by standard techniques, such as with an enzyme 
linked immunosorbent assay (ELISA) using immobilized polypeptide. At an appropriate 
time after immunization, e.g., when the specific antibody titers are highest, antibody- 
producing cells can be obtained from the subject and used to prepare monoclonal 
antibodies (mAb) by standard techniques, such as the hybridoma technique originally 

10 described by Kohler and Milstein (1975) Nature 256:495-497, the human B cell 
hybridoma technique (see Kozbor era/., \9S3, Immunol. TbJay 4:72), the EBV- 
hybridoma technique (see Cole et al^ pp. 77-96 In Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., 1985) or trioma techniques. The technology for producing 
hybridomas is well known (see generally Current Protocols in Immunology, Coligan et 

15 al ed., John Wiley & Sons, New York, 1994). Hybridoma cells producing a 

monoclonal antibody of the invention are detected by screening the hybridoma culture 
supematants for antibodies that bind the polypeptide of interest, e,g,, using a standard 
ELISA assay. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a 

20 monoclonal antibody directed against a protein of the invention can be identified and 
isolated by screening a recombinant combinatorial immunoglobulin library {e.g., an 
antibody phage display library) with the polypeptide of interest. Kits for generating and 
screening phage display libraries are commercially available (e.g., the Pharmacia 
Recombinant Phage Antibody System, Catalog No. 27-9400-01 ; and the Stratagene 

25 SurfZAP Phage Display Kit, Catalog No. 240612). Additionally, examples of methods 
and reagents particularly amenable for use in generating and screening antibody display 
library can be found in, for example, U.S. Patent No. 5,223,409; PCT Publication No. 
WO 92/18619; PCT Publication No. WO 91/17271; PCT Publication No, WO 92/20791; 
PCT Publication No. WO 92/15679; PCT Publication No. WO 93/01288; PCT 

30 Publication No. WO 92/01 047; PCT Publication No. WO 92/09690; PCT Publication 
No. WO 90/02809; Fuchs et ai (1991) Bio/Technology 9:1370-1372; Hay et al. (1992) 
Hum. Antibod. Hybridomas 3:81-85; Huse etal. (1989) Science 246:1275- 1281; 
Griffiths e/^/. (1993) EMBO J. 12:725-734. 

The invention also provides recombinant antibodies that specifically bind 

35 a protein of the invention. In preferred embodiments, the recombinant antibodies 
specifically binds a marker protein or fi-agment thereof. Recombinant antibodies 
include, but are not limited to, chimeric and humanized monoclonal antibodies. 
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comprising both human and non-human portions, single-chain antibodies and multi- 
specific antibodies. A chimeric antibody is a molecule in which different portions are 
derived fi-om different animal species, such as those having a variable region derived 
from a murine mAb and a human immunoglobulin constant region. (See, e.g., Cabilly et 
5 al., U.S. Patent No. 4,816,567; and Boss et a!., U.S. Patent No. 4,816,397, which are 
incorporated herein by reference in their entirety.) Single-chain antibodies have an 
antigen binding site and consist of single polypeptides. They can be produced by 
techniques known in the art, for example using methods described in Ladner et al U.S. 
Pat. No. 4,946,778 (which is incorporated herein by reference in its entirety); Bird et al, 

10 (1988) Science 242:423-426; Whitlow e/ a/., (1991) Methods in Enzymology 2:1-9; 
Whitlow et ai, (199\) Methods in Enzymology 2:97-105; and Huston et al., (1991) 
Methods in Enzymology Molecular Design and Modeling: Concepts and Applications 
203:46-88. Multi-specific antibodies are antibody molecules having at least two 
antigen-binding sites that specifically bind different antigens. Such molecules can be 

15 produced by techniques known in the art, for example using methods described in Segal, 
U.S. Patent No. 4,676,980 (the disclosure of which is incorporated herein by reference 
in its entirety); HoUiger et al., (1993) Proc. Natl. Acad Sci. USA 90:6444-6448; Whitlow 
et al., (1994) Protein Eng. 7:1017-1026 and U.S. Pat. No. 6,121 ,424. 

Humanized antibodies are antibody molecules from non-human species 

20 having one or more complementarity determining regions (CDRs) from the non-human 
species and a framework region from a human inmiunoglobulin molecule. (See, e.g., 
Queen, U.S. Patent No. 5,585,089, which is incorporated herein by reference in its 
entirety.) Humanized monoclonal antibodies can be produced by recombinant DNA 
techniques known in the art, for example using methods described in PCT Publication 

25 No. WO 87/0267 1 ; European Patent Application 1 84, 1 87; European Patent Application 
171,496; European Patent Application 173,494; PCT Publication No. WO 86/01533; 
U.S. Patent No. 4,8 16,567; European Patent Application 125,023; Better et al (1988) 
Science 240:1041-1043; Liu et al (1987) Proc, Natl Acad Sci. USA 84:3439-3443; Liu 
et al (1987) J. Immunol 139:3521- 3526; Sun et al (1987) Proc. Natl Acad ScL USA 

30 84:214-218; Nishimura et al (1987) Cancer Res. 47:999-1005; Wood etal (1985) 
Ato/wr^ 314:446-449; and Shaw era/. {\98S)J.Natl Cancer Inst. 80:1553-1559); 
Morrison (1985) Science 229:1202-1207; Oi et al (1986) Bio/Techniques 4:214; U.S. 
Patent 5,225,539; Jones et al (1986) Nature 321:552-525; Verhoeyan et al (1988) 
Science 239:1534; and Beidler et al (1988) J. Immunol 141 :4053'4060. 

35 More particularly, humanized antibodies can be produced, for example, 

using transgenic mice which are incapable of expressing endogenous immunoglobulin 
heavy and light chains genes, but which can express human heavy and light chain genes. 
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The transgenic mice are immunized in the normal fashion with a selected antigen, e.g., 
all or a portion of a polypeptide corresponding to a marker of the invention. Monoclonal 
antibodies directed against the antigen can be obtained using conventional hybridoma 
technology. The human immunoglobulin transgenes harbored by the transgenic mice 
5 rearrange during B cell differentiation, and subsequently undergo class switching and 
somatic mutation. Thus, using such a technique, it is possible to produce therapeutically 
useful IgG, IgA and IgE antibodies. For an overview of this technology for producing 
human antibodies, see Lonberg and Huszar (1995) Int, Rev, Immunol 13:65-93). For a 
detailed discussion of this technology for producing human antibodies and human 

10 monoclonal antibodies and protocols for producing such antibodies, see, e.g., U.S. 

Patent 5,625,126; U.S. Patent 5,633,425; U.S. Patent 5,569,825; U.S. Patent 5,661,016; 
and U.S. Patent 5,545,806. In addition, companies such as Abgenix, Inc. (Freemont, 
CA), can be engaged to provide human antibodies directed against a selected antigen 
using technology similar to that described above. 

15 Completely human antibodies which recognize a selected epitope can be 

generated using a technique referred to as "guided selection." In this approach a selected 
non-human monoclonal antibody, e.g., a murine antibody, is used to guide the selection 
of a completely human antibody recognizing the same epitope (Jespers et al, 1994, 
Bio/technology 12:899-903). 

20 The antibodies of the invention can be isolated after production {e.g. , 

from the blood or serum of the subject) or synthesis and further purified by well-known 
techniques. For example, IgG antibodies can be purified using protein A 
chromatography. Antibodies specific for a protein of the invention can be selected or 
{e.g., partially purified) or purified by, e.g., affinity chromatography. For example, a 

25 recombinantly expressed and purified (or partially purified) protein of the invention is 
produced as described herein, and covalently or non-covalently coupled to a solid 
support such as, for example, a chromatography colunm. The column can then be used 
to affinity purify antibodies specific for the proteins of the invention from a sample 
containing antibodies directed against a large number of different epitopes, thereby 

30 generating a substantially purified antibody composition, i.e., one that is substantially 
free of contaminating antibodies. By a substantially purified antibody composition is 
meant, in this context, that the antibody sample contains at most only 30% (by dry 
weight) of contaminating antibodies directed against epitopes other than those of the 
desired protein of the invention, and preferably at most 20%, yet more preferably at 

35 most 10%, and most preferably at most 5% (by dry weight) of the sample is 

contaminating antibodies. A purified antibody composition means that at least 99% of 
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the antibodies in the composition are directed against the desired protein of the 
invention. 

In a preferred embodiment, the substantially purified antibodies of the 
invention may specifically bind to a signal peptide, a secreted sequence, an extracellular 
5 domain, a transmembrane or a cytoplasmic domain or cytoplasmic membrane of a 
protein of the invention. In a particularly preferred embodiment, the substantially 
purified antibodies of the invention specifically bind to a secreted sequence or an 
extracellular domain of the amino acid sequences of a protein of the invention. In a 
more preferred embodiment, the substantially purified antibodies of the invention 

10 specifically bind to a secreted sequence or an extracellular domain of the amino acid 
sequences of a marker protein. 

An antibody directed against a protein of the invention can be used to 
isolate the protein by standard techniques, such as affinity chromatography or 
immunoprecipitation. Moreover, such an antibody can be used to detect the marker 

15 protein or fi-agment thereof (e.g., in a cellular lysate or cell supernatant) in order to 
evaluate the level and pattern of expression of the marker. The antibodies can also be 
used diagnostically to monitor protein levels in tissues or body fluids (e.g. in a breast- or 
ovary-associated body fluid) as part of a clinical testing procedure, e.g., to, for example, 
determine the efficacy of a given treatment regimen. Detection can be facilitated by the 

20 use of an antibody derivative, which comprises an antibody of the invention coupled to a 
detectable substance. Examples of detectable substances include various enzymes, 
prosthetic groups, fluorescent materials, luminescent materials, bioluminescent 
materials, and radioactive materials. Examples of suitable enzymes include horseradish 
peroxidase, alkaline phosphatase, p-galactosidase, or acetylcholinesterase; examples of 

25 suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; 
examples of suitable fluorescent materials include umbelliferone, fluorescein, 
fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl 
chloride or phycoerythrin; an example of a luminescent material includes luminol; 
examples of bioluminescent materials include luciferase, luciferin, and aequorin, and 

30 examples of suitable radioactive material include ^ ^I, ^^S or ^H. 

Antibodies of the invention may also be used as therapeutic agents in 
treating cancers. In a preferred embodiment, completely human antibodies of the 
invention are used for therapeutic treatment of human cancer patients, particularly those 
having breast or ovarian cancer. In another preferred embodiment, antibodies that bind 

35 specifically to a marker protein or fi-agment thereof are used for therapeutic treatment. 
Further, such therapeutic antibody may be an antibody derivative or immunotoxin 
comprising an antibody conjugated to a therapeutic moiety such as a cytotoxin, a 
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therapeutic agent or a radioactive metal ion. A cytotoxin or cj^otoxic agent includes any 
agent that is detrimental to cells. Examples include taxol, cytochalasin B, gramicidin 
ethidium bromide, emetine, mitomycin, etoposide, tenoposide, vincristine, vinblastine, 
colchicin, doxorubicin, daunorubicin, dihydroxy anthracin dione, mitoxantrone, 
5 mithramycin, actinomycin D, 1-dehydrotestosterone, glucocorticoids, procaine, 
tetracaine, lidocaine, propranolol, and puromycin and analogs or homologs thereof 
Therapeutic agents include, but are not limited to, antimetabolites (e.g., methotrexate, 
6-mercaptopurine, 6-thioguanine, cytarabine, 5-fluorouracil decarbazine), alkylating 
agents (e.g., mechlorethamine, thioepa chlorambucil, melphalan, carmustine (BSNU) 

10 and lomustine (CCNU), cyclothosphamide, busulfan, dibromomannitol, streptozotocin, 
mitomycin C, and cis-dichlorodiamine platinum (II) (DDP) cisplatin), anthracyclines 
(e.g., daunorubicin (formerly daunomycin) and doxorubicin), antibiotics (e.g., 
dactinomycin (formerly actinomycin), bleomycin, mithramycin, and anthramycin 
(AMC)), and anti-mitotic agents (e.g., vincristine and vinblastine). 

15 The conjugated antibodies of the invention can be used for modifying a 

given biological response, for the drug moiety is not to be construed as limited to 
classical chemical therapeutic agents. For example, the drug moiety may be a protein or 
polypeptide possessing a desired biological activity. Such proteins may include, for 
example, a toxin such as ribosome-inhibiting protein (see Better et al., U.S. Patent No. 

20 6,146,63 1 , the disclosure of which is incorporated herein in its entirety), abrin, ricin A, 
pseudomonas exotoxin, or diphtheria toxin; a protein such as tumor necrosis factor, 
.alpha.-interferon, .beta.-interferon, nerve growth factor, platelet derived growth factor, 
tissue plasminogen activator; or, biological response modifiers such as, for example, 
lymphokines, interleukin-1 ("IL-1"), interleukin-2 ("IL.2"), interleukin-6 (**IL-6"), 

25 granulocyte macrophase colony stimulating factor ("GM-CSF"), granulocyte colony 
stimulating factor ("G-CSF"), or other growth factors. 

Techniques for conjugating such therapeutic moiety to antibodies are well 
known, see, e.g., Amon et al., "Monoclonal Antibodies For Immunotargeting Of Drugs 
In Cancer Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. 

30 (eds.), pp. 243-56 (Alan R. Liss, Inc. 1985); Hellstrom et al., "Antibodies For Drug 
Delivery", in Controlled Drug Delivery(2nd Ed.), Robinson et al. (eds.), pp. 623-53 
(Marcel Dekker, Inc. 1987); Thorpe, "Antibody Carriers Of Cytotoxic Agents In Cancer 
Therapy: A Review", in Monoclonal Antibodies '84: Biological And Clinical 
Applications, Pinchera et al. (eds.), pp. 475-506 (1985); "Analysis, Results, And Future 

35 Prospective Of The Therapeutic Use Of Radiolabeled Antibody In Cancer Therapy", in 
Monoclonal Antibodies For Cancer Detection And Therapy, Baldwin et al. (eds.), pp. 
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303-16 (Academic Press 1985), and Thorpe et al., "The Preparation And Cytotoxic 
Properties Of Antibody-Toxin Conjugates", Immunol. Rev., 62:1 19-58 (1982). 

Accordingly, in one aspect, the invention provides substantially purified 
antibodies, antibody fragments and derivatives, all of which specifically bind to a 
5 protein of the invention and preferably, a marker protein. In various embodiments, the 
substantially purified antibodies of the invention, or fragments or derivatives thereof, 
can be human, non-human, chimeric and/or humanized antibodies. In another aspect, 
the invention provides non-human antibodies, antibody fragments and derivatives, all of 
which specifically bind to a protein of the invention and preferably, a marker protein. 

10 Such non-human antibodies can be goat, mouse, sheep, horse, chicken, rabbit, or rat 
antibodies. Alternatively, the non-human antibodies of the invention can be chimeric 
and/or humanized antibodies. In addition, the non-human antibodies of the invention 
can be polyclonal antibodies or monoclonal antibodies. In still a further aspect, the 
invention provides monoclonal antibodies, antibody fragments and derivatives, all of 

15 which specifically bind to a protein of the invention and preferably, a marker protein. 
The monoclonal antibodies can be human, humanized, chimeric and/or non-human 
antibodies. 

The invention also provides a kit containing an antibody of the invention 
conjugated to a detectable substance, and instructions for use. Still another aspect of the 
20 invention is a pharmaceutical composition comprising an antibody of the invention. In 
one embodiment, the pharmaceutical composition comprises an antibody of the 
invention, a therapeutic moiety, and a pharmaceutically acceptable carrier. 

III. Recombinant Expression Vectors and Host Cells 

25 Another aspect of the invention pertains to vectors, preferably expression 

vectors, containing a nucleic acid encoding a marker protein (or a portion of such a 
protein). As used herein, the term "vector" refers to a nucleic acid molecule capable of 
transporting another nucleic acid to which it has been linked. One type of vector is a 
"plasmid", which refers to a circular double stranded DNA loop into which additional 

30 DNA segments can be ligated. Another type of vector is a viral vector, wherein 
additional DNA segments can be ligated into the viral genome. Certain vectors are 
capable of autonomous replication in a host cell into which they are introduced (e.g., 
bacterial vectors having a bacterial origin of replication and episomal mammalian 
vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the 

35 genome of a host cell upon introduction into the host cell, and thereby are replicated 
along with the host genome. Moreover, certain vectors, namely expression vectors, are 
capable of directing the expression of genes to which they are operably linked. In 
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general, expression vectors of utility in recombinant DNA techniques are often in the 
form of plasmids (vectors). However, the invention is intended to include such other 
forms of expression vectors, such as viral vectors (e.g., replication defective 
retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent 
5 functions. 

The recombinant expression vectors of the invention comprise a nucleic 
acid of the invention in a form suitable for expression of the nucleic acid in a host cell. 
This means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, which is 

10 operably linked to the nucleic acid sequence to be expressed. Within a recombinant 

expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 
of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a 
host cell when the vector is introduced into the host cell). The term "regulatory 

15 sequence" is intended to include promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel, Methods in Enzymology: Gene Expression Technology vol.185. 
Academic Press, San Diego, CA (1991). Regulatory sequences include those which 
direct constitutive expression of a nucleotide sequence in many types of host cell and 

20 those which direct expression of the nucleotide sequence only in certain host cells {e.g. , 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of the 
host cell to be transformed, the level of expression of protein desired, and the like. The 
expression vectors of the invention can be introduced into host cells to thereby produce 

25 proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein. 

The recombinant expression vectors of the invention can be designed for 
expression of a marker protein or a segment thereof in prokaryotic {e.g., E. coli) or 
eukaryotic cells (e.g., insect cells {using baculovirus expression vectors}, yeast cells or 

30 mammalian cells). Suitable host cells are discussed further in Goeddel, supra. 

Alternatively, the recombinant expression vector can be transcribed and translated in 
vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in E. coli 
with vectors containing constitutive or inducible promoters directing the expression of 

35 either fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a 
protein encoded therein, usually to the amino terminus of the recombinant protein. Such 
fusion vectors typically serve three purposes: 1) to increase expression of recombinant 
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protein; 2) to increase the solubility of the recombinant protein; and 3) to aid in the 
purification of the recombinant protein by acting as a ligand in affmity purification. 
Often, in fusion expression vectors, a proteolytic cleavage site is introduced at the 
junction of the fusion moiety and the recombinant protein to enable separation of the 
5 recombinant protein from the fusion moiety subsequent to purification of the fusion 
protein. Such enzymes, and their cognate recognition sequences, include Factor Xa, 
thrombin and enterokinase. Typical fusion expression vectors include pGEX 
(Pharmacia Biotech Inc; Smith and Johnson, 1988, Gene 67:31-40), pMAL (New 
England Biolabs, Beveriy, MA) and pRIT5 (Pharmacia, Piscataway, NJ) which fuse 

1 0 glutathione S-transferase (GST), maltose E binding protein, or protein A, respectively, to 
the target recombinant protein. 

Examples of suitable inducible non-fusion E, coli expression vectors 
include pTrc (Amann et ai, 1988, Gene 69:301-315) and pET lid (Studier et al, p. 60- 
89, In Gene Expression Technology: Methods in Enzymology vol.185. Academic Press, 

15 San Diego, CA, 1991). Target gene expression from the pTrc vector relies on host RNA 
polymerase transcription from a hybrid trp-lac fusion promoter. Target gene expression 
from the pET 1 Id vector relies on transcription from a T7 gnlO-lac. fusion promoter 
mediated by a co-expressed viral RNA polymerase (T7 gnl). This viral polymerase is 
supplied by host strains BL21(DE3) or HMS174(DE3) from a resident prophage 

20 harboring a T7 gnl gene under the transcriptional control of the lacUV 5 promoter. 

One strategy to maximize recombinant protein expression in E. coli is to 
express the protein in a host bacteria with an impaired capacity to proteoiytically cleave 
the recombinant protein (Gottesman, p. 1 19-128, In Gene Expression Technology: 
Methods in Enzymology vol. 185, Academic Press, San Diego, CA, 1990. Another 

25 strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 
expression vector so that the individual codons for each amino acid are those 
preferentially utilized in £. coli (Wada et al, 1992, Nucleic Acids Res. 20:21 11-21 18). 
Such alteration of nucleic acid sequences of the invention can be carried out by standard 
DNA synthesis techniques. 

30 In another embodiment, the expression vector is a yeast expression 

vector. Examples of vectors for expression in yeast 5. cerevisiae include pYepSecl 
(BaldarieM/., 1987, EMBOl 6:229-234), pMFa(Kurjan and Herskowitz, 1982, Ce// 
30:933-943), pJRY88 (Schultz et al, 1987, Gene 54:1 13-123), pYES2 (Invitrogen 
Corporation, San Diego, C A), and pPicZ (Invitrogen Corp, San Diego, CA). 



35 
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Alternatively, the expression vector is a baculovirus expression vector 
Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., Sf 
9 cells) include the pAc series (Smith et al, 1983, Mol Cell Biol 3:2156-2165) and the 
pVL series (Lucklow and Summers, 1989, Virology 170:31-39). 
5 In yet another embodiment, a nucleic acid of the invention is expressed in 

manmialian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed, 1987, Nature 329:840) and pMT2PC 
(Kaufman e/t?/., mi.EMBOJ. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. 

10 For example, commonly used promoters are derived from polyoma. Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see chapters 16 and 1 7 of Sambrook et al , supra. 

In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell type 

15 (e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al, 
1987, Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton, 1988, 
Adv. Immunol 43:235-275), in particular promoters of T cell receptors (Winoto and 

20 Baltimore, 1 989, EMBO J. 8:729-733) and immunoglobulins (Banerji et al, 1 983, Cell 
33:729-740; Queen and Baltimore, 1983, Cell 33:741-748), neuron-specific promoters 
(e.g., the neurofilament promoter; Byme and Ruddle, 1989, Proc. Natl Acad. Sci USA 
86:5473-5477), pancreas-specific promoters (Edlund et al, 1985, Science 230:912-916), 
and mammary gland-specific promoters {e.g., milk whey promoter; U.S. Patent No. 

25 4,873,316 and European Application Publication No. 264, 1 66). Developmentally- 
regulated promoters are also encompassed, for example the murine hox promoters 
(Kessel and Gmss, 1990, Science 249:374-379) and the a-fetoprotein promoter (Camper 
and Tilghman, 1989, Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector 

30 comprising a DNA molecule of the invention cloned into the expression vector in an 
antisense orientation. That is, the DNA molecule is operably linked to a regulatory 
sequence in a manner which allows for expression (by transcription of the DNA 
molecule) of an RNA molecule which is antisense to the mRNA encoding a polypeptide 
of the invention. Regulatory sequences operably linked to a nucleic acid cloned in the 

35 antisense orientation can be chosen which direct the continuous expression of the 

antisense RNA molecule in a variety of cell types, for instance viral promoters and/or 
enhancers, or regulatory sequences can be chosen which direct constitutive, tissue- 
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Specific or cell type specific expression of antisense RNA. The antisense expression 
vector can be in the form of a recombinant plasmid, phagemid, or attenuated virus in 
which antisense nucleic acids are produced under the control of a high efficiency 
regulatory region, the activity of which can be determined by the cell type into which the 
5 vector is introduced. For a discussion of the regulation of gene expression using 
antisense genes see Weintraub et ai, 1986, Trends in Genetics, VoL 1(1). 

Another aspect of the invention pertains to host cells into which a 
recombinant expression vector of the invention has been introduced. The terms "host 
cell" and "recombinant host cell" are used interchangeably herein. It is understood that 
10 such terms refer not only to the particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications may occur in succeeding 
generations due to either mutation or environmental influences, such progeny may not, 
in fact, be identical to the parent cell, but are still included within the scope of the term 
as used herein. 

15 A host cell can be any prokaryotic (e.g., E. coli) or eukaryotic cell (e.g., 

insect cells, yeast or mammalian cells). 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 

20 techniques for introducing foreign nucleic acid into a host cell, including calcium 
phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, 
lipofection, or electroporation. Suitable methods for transforming or transfecting host 
cells can be found in Sambrook, et al (supra), and other laboratory manuals. 

For stable transfection of manunalian cells, it is known that, depending 

25 upon the expression vector and transfection technique used, only a small fi-action of cells 
may integrate die foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., for resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance to drugs, such as G418, 

30 hygromycin and methotrexate. Cells stably transfected with the introduced nucleic acid 
can be identified by drug selection (e.g., cells that have incorporated the selectable 
marker will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell 
in culture, can be used to produce a marker protein or a segment thereof. Accordingly, 

35 the invention fiirther provides methods for producing a marker protein or a segment 
thereof using the host cells of the invention. In one embodiment, the method comprises 
culturing the host cell of the invention (into which a recombinant expression vector 
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encoding a marker protein or a segment thereof has been introduced) in a suitable 
medium such that the is produced. In another embodiment, the method further 
comprises isolating the a marker protein or a segment thereof from the medium or the 
host cell. 

S The host cells of the invention can also be used to produce nonhuman 

transgenic animals. For example, in one embodiment, a host cell of the invention is a 
fertilized oocyte or an embryonic stem cell into which a sequences encoding a marker 
protein or a segment thereof have been introduced. Such host cells can then be used to 
create non-human transgenic animals in which exogenous sequences encoding a marker 

10 protein of the invention have been introduced into their genome or homologous 

recombinant animals in which endogenous gene(s) encoding a marker protein have been 
altered. Such animals are useful for studying the function and/or activity of the marker 
protein and for identifying and/or evaluating modulators of marker protein. As used 
herein, a "transgenic animal" is a non-human animal, preferably a mammal, more 

15 preferably a rodent such as a rat or mouse, in which one or more of the cells of the 

animal includes a transgene. Other examples of transgenic animals include non-human 
primates, sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous 
DNA which is integrated into the genome of a cell from which a transgenic animal 
develops and which remains in the genome of the mature animal, thereby directing the 

20 expression of an encoded gene product in one or more cell types or tissues of the 
transgenic animal. As used herein, an "homologous recombinant animal" is a non- 
human animal, preferably a mammal, more preferably a mouse, in which an endogenous 
gene has beeii altered by homologous recombination between the endogenous gene and 
an exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic 

25 cell of the animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing a 
nucleic acid encoding a marker protein into the male pronuclei of a fertilized oocyte, 
e.g., by microinjection, retroviral infection, and allowing the oocyte to develop in a 
pseudopregnant female foster animal. Intronic sequences and polyadenylation signals 

30 can also be included in the transgene to increase the efficiency of expression of the 
transgene. A tissue-specific regulatory sequence(s) can be operably linked to the 
transgene to direct expression of the polypeptide of the invention to particular cells. 
Methods for generating transgenic animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become conventional in the art and are 

35 described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, U.S. Patent No. 
4,873,191 and in Hogan, Manipulating the Mouse Embryo, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1986. Similar methods are used for 
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production of other transgenic animals. A transgenic founder animal can be identified 
based upon the presence of the transgene in its genome and/or expression of mRNA 
encoding the transgene in tissues or cells of the animals. A transgenic founder animal 
can then be used to breed additional animals carrying the transgene. Moreover, 
5 transgenic animals carrying the transgene can further be bred to other transgenic animals 
carrying other transgenes. 

To create an homologous recombinant animal, a vector is prepared which 
contains at least a portion of a gene encoding a marker protein into which a deletion, 
addition or substitution has been introduced to thereby alter, e.g., functionally disrupt, 

10 the gene. In a preferred embodiment, the vector is designed such that, upon homologous 
recombination, the endogenous gene is functionally disrupted no longer encodes a 
functional protein; also referred to as a "knock out" vector). Alternatively, the vector 
can be designed such that, upon homologous recombination, the endogenous gene is 
mutated or otherwise altered but still encodes functional protein (e.g., the upstream 

15 regulatory region can be altered to thereby alter the expression of the endogenous 
protein). In the homologous recombination vector, the altered portion of the gene is 
flanked at its 5' and 3' ends by additional nucleic acid of the gene to allow for 
homologous recombination to occur between the exogenous gene carried by the vector 
and an endogenous gene in an embryonic stem cell. The additional flanking nucleic acid 

20 sequences are of sufficient length for successful homologous recombination with the 
endogenous gene. Typically, several kilobases of flanking DNA (both at the 5* and 3' 
ends) aire included in the vector (see, e.g., Thomas and Capecchi, 1 987, Cell 5 1 :503 for a 
description of homologous recombination vectors). The vector is introduced into an 
embryonic stem cell line (e.g., by electroporation) and cells in which the introduced 

25 gene has homologously recombined with the endogenous gene are selected (see, e.g., Li 
et ai, 1 992, Cell 69:91 5). The selected cells are then injected into a blastocyst of an 
animal (e.g., a mouse) to form aggregation chimeras (see, e.g., Bradley, 
Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, Robertson, Edi, 
IRL, Oxford, 1987, pp. 1 1 3-1 52). A chimeric embryo can then be implanted into a 

30 suitable pseudopregnant female foster animal and the embryo brought to term. Progeny 
harboring the homologously recombined DNA in their germ cells can be used to breed 
animals in which all cells of the animal contain the homologously recombined DNA by 
germline transmission of the transgene. Methods for constructing homologous 
recombination vectors and homologous recombinant animals are described further in 

35 Bradley ( 1 99 1 ) Current Opinion in Bio/Technology 2 :823-829 and in PCT Publication 
NOS. WO 90/1 1354, WO 91/01 140, WO 92/0968, and WO 93/04169. 
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In another embodiment, transgenic non-human animals can be produced 
which contain selected systems which allow for regulated expression of the transgene. 
One example of such a system is the cre/loxP recombinase system of bacteriophage PI. 
For a description of the cre/loxP recombinase system, see, e.g., Lakso et al (1992) Proc, 
5 Natl Acad, ScL USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al, 1991 , Science 
251:1351-1355). If a cre/loxP recombinase system is used to regulate expression of the 
transgene, animals containing transgenes encoding both the Cre recombinase and a 
selected protein are required. Such animals can be provided through the construction of 
10 "double" transgenic animals, eg., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
recombinase. 

Clones of the non-human transgenic animals described herein can also be 
produced according to the methods described in Wilmut et al (1997) Nature 385:810- 
15 8 1 3 and PCT Publication NOS. WO 97/07668 and WO 97/07669. 

IV. Pharmaceutical Compositions 

The nucleic acid molecules, polypeptides, and antibodies (also referred to 
herein as "active compounds") of the invention can be incorporated into pharmaceutical 

20 compositions suitable for administration. Such compositions typically comprise the 
nucleic acid molecule, protein, or antibody and a pharmaceutically acceptable carrier. 
As used herein the language "pharmaceutically acceptable carrier" is intended to include 
any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, 
isotonic and absorption delaying agents, and the like, compatible with pharmaceutical 

25 administration. The use of such media and agents for pharmaceutically active 

substances is well known in the art. Except insofar as any conventional media or agent 
is incompatible with the active compound, use thereof in the compositions is 
contemplated. Supplementary active compounds can also be incorporated into the 
compositions. 

30 The invention includes methods for preparing pharmaceutical 

compositions for modulating the expression or activity of a marker nucleic acid or 
protein . Such methods comprise formulating a pharmaceutically acceptable carrier with 
an agent which modulates expression or activity of a marker nucleic acid or protein. 
Such compositions can further include additional active agents. Thus, the invention 

35 further includes methods for preparing a pharmaceutical composition by formulating a 
pharmaceutically acceptable carrier with an agent which modulates expression or 
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activity of a marker nucleic acid or protein and one or more additional active 
compounds. 

The invention also provides methods (also referred to herein as 
"screening assays") for identifying modulators, i.e., candidate or test compounds or 
5 agents (e.g., peptides, peptidomimetics, peptoids, small molecules or other drugs) which 
(a) bind to the marker, or (b) have a modulatory (e.g., stimulatory or inhibitory) effect on 
the activity of the marker or, more specifically, (c) have a modulatory effect on the 
interactions of the marker with one or more of its natural substrates {e.g., peptide, 
protein, hormone, co-factor, or nucleic acid), or (d) have a modulatory effect on the 
1 0 expression of the marker. Such assays typically comprise a reaction between the marker 
and one or more assay components. The other components may be either the test 
compound itself, or a combination of test compound and a natural binding partner of the 
marker. 

The test compounds of the present invention may be obtained from any available 

15 source, including systematic libraries of natural and/or synthetic compounds. Test 

compounds may also be obtained by any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; peptoid libraries 
(libraries of molecules having the functionalities of peptides, but with a novel, non- 
peptide backbone which are resistant to enzymatic degradation but which nevertheless 

20 remain bioactive; see, e.g., Zuckermann et ai, 1994, ./. Med. Chem. 37:2678-85); 
spatially addressable parallel solid phase or solution phase libraries; synthetic library 
methods requiring deconvolution; the 'one-bead one-compound' library method; and 
synthetic library methods using affinity chromatography selection. The biological 
library and peptoid library approaches are limited to peptide libraries, while the other 

25 four approaches are applicable to peptide, non-peptide oligomer or small molecule 
libraries of compounds (Lam, 1997, Anticancer Drug Des. 12: 145). 

Examples of methods for the synthesis of molecular libraries can be 
found in the art, for example in: DeWitt et al (1993) Proc, Natl Acad. Sci. U.S.A. 
90:6909; Erb et al (1994) Proc. Natl Acad. Sci USA 91 :1 1422; Zuckermann et al 

30 (1994). J, Med Chem. 37:2678; Cho e/ a/. (1993) 5dewe 261:1303; Carrell e/t?/. 

{\99A) Angew. Chem. Int. Ed Engl 33:2059; Carell etal (199A) Angew. Chem. Int. Ed 
Engl 33:2061; and in Gallop etal (1994) J. Med Chem. 37:1233. 

Libraries of compounds may be presented in solution (e.g., Houghten, 
1992, Biotechniques 13:412-421), or on beads (Lam, \99\, Nature 354:82-84), chips 

35 (Fodor, 1993, Nature 364:555-556), bacteria and/or spores, (Ladner, USP 5,223,409), 
plasmids (Cull et al 1 992, Proc Natl Acad Sci USA 89: 1 865- 1 869) or on phage (Scott 
and Smith, 1990, Science 249:386-390; Devlin, 1990, Science 249:404-406; Cwirla et 
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a/, 1990, Proc, Nail Acad ScL 87:6378-6382; Felici, 1991,7. Mol Biol 222:301-310; 
Ladner, 5i/j7ra.). 

In one embodiment, the invention provides assays for screening candidate 
or test compounds w^hich are substrates of a protein encoded by or corresponding to a 
5 marker or biologically active portion thereof. In another embodiment, the invention 
provides assays for screening candidate or test compounds which bind to a protein 
encoded by or corresponding to a marker or biologically active portion thereof 
Determining the ability of the test compound to directly bind to a protein can be 
accomplished, for example, by coupling the compound v^ith a radioisotope or enzymatic 

10 label such that binding of the compound to the marker can be determined by detecting 
the labeled marker compound in a complex. For example, compounds (e.g.\ marker 
substrates) can be labeled with '^^I, "^^S, ^"^C, or ^H, either directly or indirectly, and the 
radioisotope detected by direct counting of radioemission or by scintillation counting. 
Alternatively, assay components can be enzymatically labeled with, for example, 

15 horseradish peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label 
detected by determination of conversion of an appropriate substrate to product. 

In another embodiment, the invention provides assays for screening 
candidate or test compounds which modulate the expression of a marker or the activity 
of a protein encoded by or corresponding to a marker, or a biologically active portion 

20 thereof. In all likelihood, the protein encoded by or corresponding to the marker can, in 
vivo, interact with one or more molecules, such as but not limited to, peptides, proteins, 
hormones, cofactors and nucleic acids. For the purposes of this discussion, such cellular 
and extracellular molecules are referred to herein as "binding partners" or marker 
"substrate". 

25 One necessary embodiment of the invention in order to facilitate such 

screening is the use of a protein encoded by or corresponding to marker to identify the 
protein's natural in vivo binding partners. There are many ways to accomplish this which 
are known to one skilled in the art. One example is the use of the marker protein as "bait 
protein" in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. Patent No. 

30 5,283,3 1 7; Zervos et al, 1 993, Cell 72:223-232; Madura et al 1 993, J. Biol Chem. 
268:12046-12054; Bartel etal ,1993, Biotechniques 14:920-924; Iwabuchi et al, 1993 
Oncogene 8:1693-1696; Brent WO94/10300) in order to identify other proteins which 
bind to or interact with the marker (binding partners) and, therefore, are possibly 
involved in the natural function of the marker. Such marker binding partners are also 

35 likely to be involved in the propagation of signals by the marker protein or dovmstream 
elements of a marker protein-mediated signaling pathway. Alternatively, such marker 
protein binding partners may also be found to be inhibitors of the marker protein. 
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The two-hybrid system is based on the modular nature of most 
transcription factors, which consist of separable DNA-binding and activation domains. 
Briefly, the assay utilizes two different DNA constructs. In one construct, the gene that 
encodes a marker protein fused to a gene encoding the DNA binding domain of a known 
5 transcription factor {e.g., GAL-4). In the other construct, a DNA sequence, from a 
library of DNA sequences, that encodes an unidentified protein ("prey" or "sample") is 
fused to a gene that codes for the activation domain of the known transcription factor. If 
the "bait" and the "prey" proteins are able to interact, in vivo, forming a marker- 
dependent complex, the DNA-binding and activation domains of the transcription factor 

10 are brought into close proximity. This proximity allows transcription of a reporter gene 
(e.g., LacZ) which is operably linked to a transcriptional regulatory site responsive to the 
transcription factor. Expression of the reporter gene can be readily detected and cell 
colonies containing the functional transcription factor can be isolated and used to obtain 
the cloned gene which encodes the protein which interacts. with the marker protein. 

15 In a further embodiment, assays may be devised through the use of the 

invention for the purpose of identifying compounds which modulate {e.g., affect either 
positively or negatively) interactions between a marker protein and its substrates and/or 
binding partners. Such compounds can include, but are not limited to, molecules such as 
antibodies, peptides, hormones, oligonucleotides, nucleic acids, and analogs thereof. 

20 Such compounds may also be obtained from any available source, including systematic 
libraries of natural and/or synthetic compounds. The preferred assay components for use 
in this embodiment is a breast or ovarian cancer marker protein identified herein, the 
known binding partner and/or substrate of same, and the test compound. Test 
compounds can be supplied from any source. 

25 The basic principle of the assay systems used to identify compounds that 

interfere with the interaction between the marker protein and its binding partner 
involves preparing a reaction mixture containing the marker protein and its binding 
partner under conditions and for a time sufficient to allow the two products to interact 
and bind, thus forming a complex. In order to test an agent for inhibitory activity, the 

30 reaction mixture is prepared in the presence and absence of the test compound. The test 
compound can be initially included in the reaction mixture, or can be added at a time 
subsequent to the addition of the marker protein and its binding partner. Control 
reaction mixtures are incubated without the test compound or with a placebo. The 
formation of any complexes between the marker protein and its binding partner is then 

35 detected. The formation of a complex in the control reaction, but less or no such 
formation in the reaction mixture containing the test compound, indicates that the 
compound interferes with the interaction of the marker protein and its binding partner. 
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Conversely, the formation of more complex in the presence of compound than in the 
control reaction indicates that the compound may enhance interaction of the marker 
protein and its binding partner. 

The assay for compounds that interfere with the interaction of the marker 
5 protein with its binding partner may be conducted in a heterogeneous or homogeneous 
format. Heterogeneous assays involve anchoring either the marker protein or its binding 
partner onto a solid phase and detecting complexes anchored to the solid phase at the 
end of the reaction. In homogeneous assays, the entire reaction is carried out in a liquid 
phase. In either approach, the order of addition of reactants can be varied to obtain 

10 different information about the compounds being tested. For example, test compounds 
that interfere with the interaction between the marker proteins and the binding partners 
{e.g., by competition) can be identified by conducting the reaction in the presence of the 
test substance, Le., by adding the test substance to the reaction mixture prior to or 
simultaneously with the marker and its interactive binding partner. Alternatively, test 

1 5 compounds that disrupt preformed complexes, e.g., compounds with higher binding 

constants that displace one of the components from the complex, can be tested by adding 
the test compound to the reaction mixture after complexes have been formed. The 
various formats are briefly described below. 

In a heterogeneous assay system, either the marker protein or its binding 

20 partner is anchored onto a solid surface or matrix, while the other corresponding non- 
anchored component may be labeled, either directly or indirectly. In practice, microtitre 
plates are often utilized for this approach. The anchored species can be immobilized by a 
number of methods, either non-covalent or covalent, that are typically well known to one 
who practices the art. Non-covalent attachment can often be accomplished simply by 

25 coating the solid surface with a solution of the marker protein or its binding partner and 
drying. Alternatively, an inunobilized antibody specific for the assay component to be 
anchored can be used for this purpose. Such surfaces can often be prepared in advance 
and stored. 

In related embodiments, a fiision protein can be provided which adds a 
30 domain that allows one or both of the assay components to be anchored to a matrix. For 
example, glutathione-S-transferase/marker fiision proteins or glutathione-S- 
transferase/binding partner can be adsorbed onto glutathione sepharose beads (Sigma 
Chemical, St. Louis, MO) or glutathione derivatized microtiter plates, which are then 
combined with the test compound or the test compound and either the non-adsorbed 
35 marker or its binding partner, and the mixture incubated under conditions conducive to 
complex formation (e.g., physiological conditions). Following incubation, the beads or 
microtiter plate wells are washed to remove any unbound assay components, the 
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immobilized complex assessed either directly or indirectly, for example, as described 
above. Alternatively, the complexes can be dissociated from the matrix, and the level of 
marker binding or activity determined using standard techniques. 
Other techniques for inmiobilizing proteins on matrices can also be used in the screening 
5 assays of the invention. For example, either a marker protein or a marker protein 
binding partner can be immobilized utilizing conjugation of biotin and streptavidin. 
Biotinylated marker protein or target molecules can be prepared from biotin-NHS (N- 
hydroxy-succinimide) using techniques known in the art {e.g., biotinylation kit, Pierce 
Chemicals, Rockford, IL). and immobilized in the wells of streptavidin-coated 96 well 

10 plates (Pierce Chemical). In certain embodiments, the protein-immobilized surfaces can 
be prepared in advance and stored. 

In order to conduct the assay, the corresponding partner of the 
immobilized assay component is exposed to the coated surface with or without the test 
compound. After the reaction is complete, unreacted assay components are removed 

15 (e.g., by washing) and any complexes formed will remain immobilized on the solid 
surface. The detection of complexes anchored on the solid surface can be accomplished 
in a number of ways. Where the non-immobilized component is pre-labeled, the 
detection of label immobilized on the surface indicates that complexes were formed. 
Where the non-immobilized component is not pre-labeled, an indirect label can be used 

20 to detect complexes anchored on the surface; e.g., using a labeled antibody specific for 
the initially non-immobilized species (the antibody, in turn, can be directly labeled or 
indirectly labeled with, e.g., a labeled anti-Ig antibody). Depending upon the order of 
addition of reaction components, test compoxmds which modulate (inhibit or enhance) 
complex formation or which disrupt preformed complexes can be detected. 

25 In an alternate embodiment of the invention, a homogeneous assay may 

be used. This is typically a reaction, analogous to those mentioned above, which is 
conducted in a liquid phase in the presence or absence of the test compound. The formed 
complexes are then separated from unreacted components, and the amount of complex 
formed is determined. As mentioned for heterogeneous assay systems, the order of 

30 addition of reactants to the liquid phase can yield information about which test 
compounds modulate (inhibit or enhance) complex formation and which disrupt 
preformed complexes. 

In such a homogeneous assay, the reaction products may be separated 
from unreacted assay components by any of a number of standard techniques, including 

35 but not limited to: differential centrifiigation, chromatography, electrophoresis and 
immunoprecipitation. In differential centrifijgation, complexes of molecules may be 
separated from uncomplexed molecules through a series of centrifiigal steps, due to the 
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different sedimentation equilibria of complexes based on their di£ferent sizes and 
densities (see, for example, Rivas, G., and Minton, A.P., Trends Biochem Sci 1993 
Aug;18(8):284-7). Standard chromatographic techniques may also be utilized to separate 
complexed molecules from uncomplexed ones. For example, gel filtration 
5 chromatography separates molecules based on size, and through the utilization of an 
appropriate gel filtration resin in a colunm format, for example, the relatively larger 
complex may be separated from the relatively smaller uncomplexed components. 
Similarly, the relatively different charge properties of the complex as compared to the 
uncomplexed molecules may be exploited to differentially separate the complex from 

1 0 the remaining individual reactants, for example through the use of ion-exchange 

chromatography resins. Such resins and chromatographic techniques are well known to 
one skilled in the art (see, e.g., Heegaard, 1998, J Md/. Recognit, 11 : 141-148; Hage and 
Tweed, 1997, J. Chromatogr. A Biomed ScL Appi, 699:499-525). Gel electrophoresis 
may also be employed to separate complexed molecules from unbound species (see, e.g., 

15 Ausubel et al (eds.), In: Current Protocols in Molecular Biology, J. Wiley & Sons, New 
York. 1999). In this technique, protein or nucleic acid complexes are separated based on 
size or charge, for example. In order to maintain the binding interaction during the 
electrophoretic process, nondenaturing gels in the absence of reducing agent are 
typically preferred, but conditions appropriate to the particular interactants will be well 

20 known to one skilled in the art. Immunoprecipitation is another common technique 

utilized for the isolation of a protein-protein complex from solution (see, e.g., Ausubel et 
al (eds.), In: Current Protocols in Molecular Biology, J. Wiley & Sons, New York. 
1999). In this technique, all proteins binding to an antibody specific to one of the 
binding molecules are precipitated from solution by conjugating the antibody to a 

25 polymer bead that may be readily collected by centrifiigation. The bound assay 

components are released from the beads (through a specific proteolysis event or other 
technique well knov^oi in the art which will not disturb the protein-protein interaction in 
the complex), and a second immunoprecipitation step is performed, this time utilizing 
antibodies specific for the correspondingly different interacting assay component. In this 

30 manner, only formed complexes should remain attached to the beads. Variations in 
complex formation in both the presence and the absence of a test compound can be 
compared, thus offering information about the ability of the compound to modulate 
interactions between the marker protein and its binding partner. 

Also within the scope of the present invention are methods for direct 

35 detection of interactions between the marker protein and its natural binding partner 
and/or a test compound in a homogeneous or heterogeneous assay system without 
further sample manipulation. For example, the technique of fluorescence energy transfer 
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may be utilized (see, e,g,, Lakowicz etal, U.S. Patent No. 5,631,169; Stavrianopoulos et 
al, U.S. Patent No. 4,868,103). Generally, this technique involves the addition of a 
fluorophore label on a first 'donor' molecule (e.g., marker or test compound) such that 
its emitted fluorescent energy will be absorbed by a fluorescent label on a second, 
5 'acceptor' molecule (e.g., marker or test compound), which in turn is able to fluoresce 
due to the absorbed energy. Alternately, the 'donor' protein molecule may simply 
utilize the natural fluorescent energy of tryptophan residues. Labels are chosen that emit 
different wavelengths of light, such that the 'acceptor' molecule label may be 
differentiated from that of the 'donor'. Since the efficiency of energy transfer between 

10 the labels is related to the distance separating the molecules, spatial relationships 

between the molecules can be assessed. In a situation in which binding occurs between 
the molecules, the fluorescent emission of the 'acceptor' molecule label in the assay 
should be maximal. An FET binding event can be conveniently measured through 
standard fluorometric detection means well known in the art (e.g., using a fluorimeter). 

15 A test substance which either enhances or hinders participation of one of the species in 
the preformed complex will result in the generation of a signal variant to that of 
background. In this way, test substances that modulate interactions between a marker 
and its binding partner can be identified in controlled assays. 

In another embodiment, modulators of marker expression are identified in 

20 a method wherein a cell is contacted with a candidate compound and the expression of 
marker mRNA or protein in the cell, is determined. The level of expression of marker 
mRNA or protein in the presence of the candidate compound is compared to the level of 
expression of marker mRNA or protein in the absence of the candidate compound. The 
candidate compound can then be identified as a modulator of marker expression based 

25 on this comparison. For example, when expression of marker mRNA or protein is 
greater (statistically significantly greater) in the presence of the candidate compound 
than in its absence, the candidate compound is identified as a stimulator of marker 
mRNA or protein expression. Conversely, when expression of marker mRNA or protein 
is less (statistically significantly less) in the presence of the candidate compound than in 

30 its absence, the candidate compound is identified as an inhibitor of marker mRNA or 
protein expression. The level of marker mRNA or protein expression in the cells can be 
determined by methods described herein for detecting marker mRNA or protein. 

In another aspect, the invention pertains to a combination of two or more 
of the assays described herein. For example, a modulating agent can be identified using 

35 a cell-based or a cell free assay, and the ability of the agent to modulate the activity of a 
marker protein can be further confirmed in v/vo, e.g., in a whole animal model for 
cellular transformation and/or timiorigenesis: 
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This invention further pertains to novel agents identified by the above- 
described screening assays. Accordingly, it is within the scope of this invention to 
further use an agent identified as described herein in an appropriate animal model. For 
example, an agent identified as described herein {e.g., an marker modulating agent, an 
5 antisense marker nucleic acid molecule, an marker-specific antibody, or an marker- 
binding partner) can be used in an animal model to determine the efficacy, toxicity, or 
side effects of treatment with such an agent. Alternatively, an agent identified as 
described herein can be used in an animal model to determine the mechanism of action 
of such an agent. Furthermore, this invention pertains to uses of novel agents identified 

10 by the above-described screening assays for treatments as described herein. 

It is understood that appropriate doses of small molecule agents and 
protein or polypeptide agents depends upon a number of factors within the knowledge of 
the ordinarily skilled physician, veterinarian, or researcher. The dose(s) of these agents 
will vary, for example, depending upon the identity, size, and condition of the subject or 

15 sample being treated, further depending upon the route by which the composition is to 
be administered, if applicable, and the effect which the practitioner desires the agent to 
have upon the nucleic acid or polypeptide of the invention. Exemplary doses of a small 
molecule include milligram or microgram amounts per kilogram of subject or sample 
weight (e.g. about 1 microgram per kilogram to about 500 milligrams per kilogram, 

20 about 100 micrograms per kilogram to about 5 milligrams per kilogram, or about 1 
microgram per kilogram to about 50 micrograms per kilogram). Exemplary doses of a 
protein or polypeptide include gram, milligram or microgram amounts per kilogram of 
subject or sample weight (e.g. about 1 microgram per kilogram to about 5 grams per 
kilogram, about 100 micrograms per kilogram to about 500 milligrams per kilogram, or 

25 about 1 milligram per kilogram to about 50 milligrams per kilogram). It is furthermore 
understood that appropriate doses of one of these agents depend upon the potency of the 
agent with respect to the expression or activity to be modulated. Such appropriate doses 
can be determined using the assays described herein. When one or more of these agents 
is to be administered to an animal (e.g. a human) in order to modulate expression or 

30 activity of a polypeptide or nucleic acid of the invention, a physician, veterinarian, or 
researcher can, for example, prescribe a relatively low dose at first, subsequently 
increasing the dose until an appropriate response is obtained. In addition, it is 
understood that the specific dose level for any particular animal subject will depend 
upon a variety of factors including the activity of the specific agent employed, the age, 

35 body weight, general health, gender, and diet of the subject, the time of administration, 
the route of administration, the rate of excretion, any drug combination, and the degree 
of expression or activity to be modulated. 
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A pharmaceutical composition of the invention is fcfrmulated to be 
compatible with its intended route of administration. Examples of routes of 
administration include parenteral, e.g., intravenous, intradermal, subcutaneous, oral 
{e.g., inhalation), transdermal (topical), transmucosal, and rectal administration. 
5 Solutions or suspensions used for parenteral, intradermal, or subcutaneous application 
can include the following components: a sterile diluent such as v/ater for injection, saline 
solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants 
such as ascorbic acid or sodium bisulfite; chelating agents such as ethylenediamine- 
10 tetraacetic acid; buffers such as acetates, citrates or phosphates and agents for the 
adjustment of tonicity such as sodium chloride or dextrose. pH can be adjusted with 
acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in ampules, disposable syringes or multiple dose vials made 
of glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile 
aqueous solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersions. For 
intravenous administration, suitable carriers include physiological saline, bacteriostatic 
water, Cremophor EL (BASF; Parsippany, NJ) or phosphate buffered saline (PBS). In 
all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of microorganisms such as 
bacteria and fimgi. The carrier can be a solvent or dispersion medium containing, for 
example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 
polyethylene glycol, and the like), and suitable mixtures thereof The proper fluidity can 
be maintained, for example, by the use of a coating such as lecithin, by the maintenance 
of the required particle size in the case of dispersion and by the use of surfactants. 
Prevention of the action of microorganisms can be achieved by various antibacterial and 
antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, 
thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, 
for example, sugars, polyalcohols such as mannitol, sorbitol, or sodium chloride in the 
composition. Prolonged absorption of the injectable compositions can be brought about 
by including in the composition an agent which delays absorption, for example, 
aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 
compound (e.g., a polypeptide or antibody) in the required amount in an appropriate 
solvent with one or a combination of ingredients enumerated above, as required. 
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followed by filtered sterilization. Generally, dispersions are prepared by incorporating 
the active compound into a sterile vehicle which contains a basic dispersion medium, 
and then incorporating the required other ingredients from those enumerated above. In 
the case of sterile powders for the preparation of sterile injectable solutions, the 
5 preferred methods of preparation are vacuum drying and freeze-drying which yields a 
powder of the active ingredient plus any additional desired ingredient from a previously 
sterile-filtered solution thereof 

Oral compositions generally include an inert diluent or an edible carrier. 
They can be enclosed in gelatin capsules or compressed into tablets. For the purpose of 

10 oral therapeutic administration, the active compound can be incorporated with excipients 
and used in the form of tablets, troches, or capsules. Oral compositions can also be 
prepared using a fluid carrier for use as a mouthwash, wherein the compound in the fluid 
carrier is applied orally and swished and expectorated or swallowed. 

Pharmaceutically compatible binding agents, and/or adjuvant materials 

15 can be included as part of the composition. The tablets, pills, capsules, troches, and the 
like can contain any of the following ingredients, or compounds of a similar nature: a 
binder such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as 
starch or lactose, a disintegrating agent such as alginic acid, Primogel, or com starch; a 
lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal silicon 

20 dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring agent such as 
peppermint, methyl salicylate, or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form 
of an aerosol spray from a pressurized container or dispenser which contains a suitable 
propellant, e.g., a gas such as carbon dioxide, or a nebulizer. 

25 Systemic administration can also be by transmucosal or transdermal 

means. For transmucosal or transdermal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation. Such penetrants are generally 
known in the art, and include, for example, for transmucosal administration, detergents, 
bile salts, and fusidic acid derivatives. Transmucosal administration can be 

30 accomplished through the use of nasal sprays or suppositories. For transdermal 
administration, the active compounds are formulated into ointments, salves, gels, or 
creams as generally known in the art. 

The compounds can also be prepared in the form of suppositories (e.g., 
with conventional suppository bases such as cocoa butter and other glycerides) or 

35 retention enemas for rectal delivery. 
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In one embodiment, the active compounds are prepared with carriers that 
will protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
5 polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 

Methods for preparation of such formulations will be apparent to those skilled in the art. 
The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes having monoclonal 
antibodies incorporated therein or thereon) can also be used as pharmaceutically 

10 acceptable carriers. These can be prepared according to methods known to those skilled 
in the art, for example, as described in U.S. Patent No. 4,522,81 1. 

It is especially advantageous to formulate oral or parenteral compositions 
in dosage unit form for ease of administration and uniformity of dosage. Dosage unit 
form as used herein refers to physically discrete units suited as unitary dosages for the 

15 subject to be treated; each unit containing a predetermined quantity of active compound 
calculated to produce the desired therapeutic effect in association with the required . 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound 
and the particular therapeutic effect to be achieved, and the limitations inherent in the art 

20 of compounding such an active compound for the treatment of individuals. 

For antibodies, the preferred dosage is 0.1 mg/kg to ICQ mg/kg of body 
weight (generally 10 mg/kg to 20 mg/kg). If the antibody is to act in the brain, a dosage 
of 50 mg/kg to 100 mg/kg is usually appropriate. Generally, partially human antibodies 
and fiilly human antibodies have a longer half-life within the human body than other 

25 antibodies. Accordingly, lower dosages and less frequent administration is often 
possible. Modifications such as lipidation can be used to stabilize antibodies and to 
enhance uptake and tissue penetration. A method for lipidation of antibodies is 
described by Cruikshank et al (1997) J, Acquired Immune Deficiency Syndromes and 
Human Retrovirology 14:193. 

30 The marker nucleic acid molecules can be inserted into vectors and used 

as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (U.S. Patent 5,328,470), or by 
stereotactic injection (see, e.g., Chen et al, 1994, Proc. Natl Acad. Sci USA 91 :3054- 
3057). The pharmaceutical preparation of the gene therapy vector can include the gene 

35 therapy vector in an acceptable diluent, or can comprise a slow release matrix in which 
the gene delivery vehicle is imbedded. Alternatively, where the complete gene delivery 
vector can be produced intact from recombinant cells, e.g. retroviral vectors, the 
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pharmaceutical preparation can include one or more cells which produce the gene 
delivery system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

V. Predictive Medicine 

The present invention pertains to the field of predictive medicine in 
which diagnostic assays, prognostic assays, pharmacogenomics, and monitoring clinical 
trails are used for prognostic (predictive) purposes to thereby treat an individual 
prophylactically. Accordingly, one aspect of the present invention relates to diagnostic 
assays for determining the level of expression of one or more marker proteins or nucleic 
acids, in order to determine whether an individual is at risk of developing breast or 
ovarian cancer. Such assays can be used for prognostic or predictive purposes to thereby 
prophylactically treat an individual prior to the onset of the cancer. 

Yet another aspect of the invention pertains to monitoring the influence 
of agents (e.g., drugs or other compounds administered either to inhibit breast or ovarian 
cancer or to treat or prevent any other disorder {/.e, in order to understand any breast or 
ovarian carcinogenic effects that such treatment may have} ) on the expression or 
activity of a marker of the invention in clinical trials. These and other agents are 
described in further detail in the following sections. 

A. Diagnostic Assays 

An exemplary method for detecting the presence or absence of a marker 
protein or nucleic acid in a biological sample involves obtaining a biological sample 
(e.g. a breast- or ovary-associated body fluid) from a test subject and contacting the 
biological sample with a compound or an agent capable of detecting the polypeptide or 
nucleic acid (e.g., mRNA, genomic DNA, or cDNA). The detection methods of the 
invention can thus be used to detect mRNA, protein, cDNA, or genomic DNA, for 
example, in a biological sample in vitro as well as in vivo. For example, in vitro 
techniques for detection of mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of a marker protein include enzyme 
linked immunosorbent assays (ELISAs), Western blots, immunoprecipitations and 
immimofluorescence. In vitro techniques for detection of genomic DNA include 
Southern hybridizations. Furthermore, in vivo techniques for detection of a marker 
protein include introducing into a subject a labeled antibody directed against the protein 
or fragment thereof For example, the antibody can be labeled with a radioactive marker 
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whose presence and location in a subject can be detected by standard imaging 
techniques. 

A general principle of such diagnostic and prognostic assays involves 
preparing a sample or reaction mixture that may contain a marker, and a probe, under 
5 appropriate conditions and for a time sufficient to allow the marker and probe to interact 
and bind, thus forming a complex that can be removed and/or detected in the reaction 
mixture. These assays can be conducted in a variety of ways. 

For example, one method to conduct such an assay would involve 
anchoring the marker or probe onto a solid phase support, also referred to as a substrate, 
10 and detecting target marker/probe complexes anchored on the solid phase at the end of 
the reaction. In one embodiment of such a method, a sample from a subject, which is to 
be assayed for presence and/or concentration of marker, can be anchored onto a carrier 
or solid phase support. In another embodiment, the reverse situation is possible, in 
which the probe can be anchored to a solid phase and a sample from a subject can be 
1 5 allowed to react as an unanchored component of the assay. 

There are many established methods for anchoring assay components to 
a solid phase. These include, without limitation, marker or probe molecules which are 
immobilized through conjugation of biotin and streptavidin. Such biotinylated assay 
components can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
20 techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). In 
certain embodiments, the surfaces with immobilized assay components can be prepared 
in advance and stored. 

Other suitable carriers or solid phase supports for such assays include any 
25 material capable of binding the class of molecule to which the marker or probe belongs. 
Well-known supports or carriers include, but are not limited to, glass, polystyrene, 
nylon, polypropylene, nylon, polyethylene, dextran, amylases, natural and modified 
celluloses, polyacrylamides, gabbros, and magnetite. 

In order to conduct assays with the above mentioned approaches, the non- 
30 inunobilized component is added to the solid phase upon which the second component 
is anchored. After the reaction is complete, uncomplexed components may be removed 
{e.g., by washing) imder conditions such that any complexes formed will remain 
immobilized upon the solid phase. The detection of marker/probe complexes anchored 
to the solid phase can be accomplished in a number of methods outlined herein. 



35 
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In a preferred embodiment, the probe, when it is the unanchored assay 
component, can be labeled for the purpose of detection and readout of the assay, either 
directly or indirectly, with detectable labels discussed herein and which are well-known 
to one skilled in the art. 
5 It is also possible to directly detect marker/probe complex formation 

without further manipulation or labeling of either component (marker or probe), for 
example by utilizing the technique of fluorescence energy transfer (see, for example, 
Lakowicz et aL, U.S. Patent No. 5,631,169; Stavrianopoulos, et al, U.S. Patent No. 
4,868,103). A fluorophore label on the first, 'donor' molecule is selected such that, upon 

1 0 excitation with incident light of appropriate wavelength, its emitted fluorescent energy 
will be absorbed by a fluorescent label on a second 'acceptor' molecule, which in turn is 
able to fluoresce due to the absorbed energy. Alternately, the 'donor' protein molecule 
may simply utilize the natural fluorescent energy of tryptophan residues. Labels are 
chosen that emit different wavelengths of light, such that the 'acceptor' molecule label 

1 5 may be differentiated from that of the 'donor' . Since the efficiency of energy transfer 
between the labels is related to the distance separating the molecules, spatial 
relationships between the molecules can be assessed. In a situation in which binding 
occurs between the molecules, the fluorescent emission of the 'acceptor' molecule label 
in the assay should be maximal. An FET binding event can be conveniently measured 

20 through standard fluorometric detection means well known in the art {e.g. , using a 
fluorimeter). 

In another embodiment, determination of the ability of a probe to 
recognize a marker can be accomplished without labeling either assay component (probe 
or marker) by utilizing a technology such as real-time Biomolecular Interaction Analysis 

25 (BIA) (see, eg., Sjolander, S. and Urbaniczky, C, 1991, ^ra/. Chem. 63:2338-2345 and 
Szabo et aL, 1995, Curr. Opin. Struct Biol 5:699-705). As used herein, "BIA'' or 
"surface plasmon resonance" is a technology for studying biospecific interactions in real 
time, without labeling any of the interactants (e.g., BIAcore). Changes in the mass at the 
binding surface (indicative of a binding event) result in alterations of the refractive index 

30 of light near the surface (the optical phenomenon of surface plasmon resonance (SPR)), 
resulting in a detectable signal which can be used as ah indication of real-time reactions 
between biological molecules. 

Alternatively, in another embodiment, analogous diagnostic and 
prognostic assays can be conducted with marker and probe as solutes in a liquid phase. 

35 In such an assay, the complexed marker and probe are separated from uncomplexed 
components by any of a number of standard techniques, including but not limited to: 
differential centrifiigation, chromatography, electrophoresis and immunoprecipitation. 
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In differential centrifugation, marker/probe complexes may be separated from 
uncomplexed assay components through a series of centrifugal steps, due to the different 
sedimentation equilibria of complexes based on their different sizes and densities (see, 
for example, Rivas, G., and Minton, A.P., 1993, Trends Biochem Set 18(8):284-7). 
5 Standard chromatographic techniques may also be utilized to separate complexed 
molecules from uncomplexed ones. For example, gel filtration chromatography 
separates molecules based on size, and through the utilization of an appropriate gel 
fihration resin in a column format, for example, the relatively larger complex may be 
separated from the relatively smaller uncomplexed components. Similarly, the relatively 

10 different charge properties of the marker/probe complex as compared to the 
uncomplexed components may be exploited to differentiate the complex from 
uncomplexed components, for example through the utilization of ion-exchange 
chromatography resins. Such resins and chromatographic techniques are well known to 
one skilled in the art (see, e.g., Heegaard, N.H., 1998, J. MoL Recognit. Winter 11(1- 

15 6):141-8; Hage, D.S., and Tweed, S.A. JChromatogr B Biomed Sci Appl 1997 Oct 

10;699(l-2):499-525). Gel electrophoresis may also be employed to separate complexed 
assay components from unbound components (see, e.g., Ausubel et al, ed., Current 
Protocols in Molecular Biology, John Wiley & Sons, New York, 1987-1999). In this 
technique, protein or nucleic acid complexes are separated based on size or charge, for 

20 example. In order to maintain the binding interaction during the electrophoretic process, 
non-denaturing gel matrix materials and conditions in the absence of reducing agent are 
typically preferred. Appropriate conditions to the particular assay and components 
thereof will be well known to one skilled in the art. 

In a particular embodiment, the level of marker mRNA can be 

25 determined both by in situ and by in vitro formats in a biological sample using methods 
known in the art. The term "biological sample" is intended to include tissues, cells, 
biological fluids and isolates thereof, isolated from a subject, as well as tissues, cells and 
fluids present within a subject. Many expression detection methods use isolated RNA. 
For in vitro methods, any RNA isolation technique that does not select against the 

30 isolation of mRNA can be utilized for the purification of RNA from breast or ovarian 
cells (see, e.g., Ausubel et al, ed., Current Protocols in Molecular Biology, John Wiley 
& Sons, New York 1 987-1 999). Additionally, large numbers of tissue samples can 
readily be processed using techniques well known to those of skill in the art, such as, for 
example, the single-step RNA isolation process of Chomczynski (1989, U.S. Patent No. 

35 4,843,155). 
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The isolated mRNA can be used in hybridization or amplification assays 
that include, but are not limited to, Southem or Northern analyses, polymerase chain 
reaction analyses and probe arrays. One preferred diagnostic method for the detection of 
mRNA levels involves contacting the isolated mRNA with a nucleic acid molecule 
5 (probe) that can hybridize to the mRNA encoded by the gene being detected. The 

nucleic acid probe can be, for example, a full-length cDNA, or a portion thereof, such as 
an oligonucleotide of at least 7, 15, 30, 50, 100, 250 or 500 nucleotides in length and 
sufficient to specifically hybridize under stringent conditions to a mRNA or genomic 
DNA encoding a marker of the present invention. Other suitable probes for use in the 

10 diagnostic assays of the invention are described herein. Hybridization of an mRNA with 
the probe indicates that the marker in question is being expressed. 

In one format, the mRNA is immobilized on a solid surface and contacted 
with a probe, for example by running the isolated mRNA on an agarose gel and 
transferring the mRNA from the gel to a membrane, such as nitrocellulose. In an 

15 alternative format, the probe(s) are immobilized on a solid surface and the mRNA is 
contacted with the probe(s), for example, in an Affymetrix gene chip array. A skilled 
artisan can readily adapt known mRNA detection methods for use in detecting the level 
of mRNA encoded by the markers of the present invention. 

An alternative method for determining the level of mRNA marker in a 

20 sample involves the process of nucleic acid amplification, ^.g,, by rtPCR (the 

experimental embodiment set forth in Mullis, 1987, U.S. Patent No. 4,683,202), ligase 
chain reaction (Barany, 1991, Proc. Natl Acad ScL USA, 88:189-193), self sustained 
sequence replication (Guatelli et al, 1990, Proc. Natl Acad ScL USA 87:1874-1878), 
transcriptional amplification system (Kwoh et al, 1989, Proc Natl Acad. ScL USA 

25 86:1173-1177), Q-Beta Replicase (Lizardi etal, 1988, Bio/Technology 6:1197), rolling 
circle replication (Lizardi et al, U.S. Patent No. 5,854,033) or any other nucleic acid 
amplification method, followed by the detection of the amplified molecules using 
techniques well known to those of skill in the art. These detection schemes are 
especially useful for the detection of nucleic acid molecules if such molecules are 

30 present in very low numbers. As used herein, amplification primers are defined as being 
a pair of nucleic acid molecules that can anneal to 5' or 3' regions of a gene (plus and 
minus strands, respectively, or vice- versa) and contain a short region in between. In 
general, amplification primers are from about 10 to 30 nucleotides in length and flank a 
region from about 50 to 200 nucleotides in length. Under appropriate conditions and 

35 with appropriate reagents, such primers permit the amplification of a nucleic acid 
molecule comprising the nucleotide sequence flanked by the primers. 
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For in situ methods, mRNA does not need to be isolated from the breast 
or ovarian cells prior to detection. In such methods, a cell or tissue sample is 
prepared/processed using known histological methods. The sample is then immobilized 
on a support, typically a glass slide, and then contacted with a probe that can hybridize 
5 to mRNA that encodes the marker. 

As an alternative to making determinations based on the absolute 
expression level of the marker, determinations may be based on the normalized 
expression level of the marker. Expression levels are normalized by correcting the 
absolute expression level of a marker by comparing its expression to the expression of a 

10 gene that is not a marker, e.g., a housekeeping gene that is constitutively expressed. 
Suitable genes for normalization include housekeeping genes such as the actin gene, or 
epithelial cell-specific genes. This normalization allows the comparison of the 
expression level in one sample, e.g. , a patient sample, to another sample, e.g., a non- 
breast or non-ovarian cancer sample, or between samples from different sources. 

1 5 Alternatively, the expression level can be provided as a relative 

expression level. To determine a relative expression level of a marker, the level of 
expression of the marker is determined for 10 or more samples of normal versus cancer 
cell isolates, preferably 50 or more samples, prior to the determination of the expression 
level for the sample in question. The mean expression level of each of the genes assayed 

20 in the larger number of samples is determined and this is used as a baseline expression 
level for the marker. The expression level of the marker determined for the test sample 
(absolute level of expression) is then divided by the mean expression value obtained for 
that marker. This provides a relative expression level. 

Preferably, the samples used in the baseline determination will be from 

25 breast or ovarian cancer or from non-breast or non-ovarian cancer cells of breast or 
ovarian tissue. The choice of the cell source is dependent on the use of the relative 
expression level. Using expression found in normal tissues as a mean expression score 
aids in validating whether the marker assayed is breast or ovarian specific (versus 
normal cells). In addition, as more data is accumulated, the mean expression value can 

30 be revised, providing improved relative expression values based on accumulated data. 
Expression data from breast or ovarian cells provides a means for grading the severity of 
the breast or ovarian cancer state. 

In another embodiment of the present invention, a marker protein is 
detected, A prefened agent for detecting marker protein of the invention is an antibody 

35 capable of binding to such a protein or a fragment thereof, preferably an antibody with a 
detectable label. Antibodies can be polyclonal, or more preferably, monoclonal. An 
intact antibody, or a fragment or derivative thereof (e.g., Fab or F(ab')2) can be used. 
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The term "labeled", with regard to the probe or antibody, is intended to encompass direct 
labeling of the probe or antibody by coupling physically linking) a detectable 
substance to the probe or antibody, as well as indirect labeling of the probe or antibody 
by reactivity with another reagent that is directly labeled. Examples of indirect labeling 
5 include detection of a primary antibody using a fluorescently labeled secondary antibody 
and end-labeling of a DNA probe with biotin such that it can be detected with 
fluorescently labeled streptavidin. 

Proteins from breast or ovarian cells can be isolated using techniques that 
are well known to those of skill in the art. The protein isolation methods employed can, 

10 for example, be such as those described in Harlow and Lane (Harlow and Lane, 1988, 
Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, New York). 

A variety of formats can be employed to determine whether a sample 
contains a protein that binds to a given antibody. Examples of such formats include, but 

1 5 are not limited to, enzyme immunoassay (EI A), radioimmunoassay (RIA), Western blot 
analysis and enzyme linked immunoabsorbant assay (ELISA). A skilled artisan can 
readily adapt known protein/antibody detection methods for use in determining whether 
breast or ovarian cells express a marker of the present invention. 

In one format, antibodies, or antibody fragments or derivatives, can be 

20 used in methods such as Western blots or immunofluorescence techniques to detect the 
expressed proteins. In such uses, it is generally preferable to immobilize either the 
antibody or proteins on a solid support. Suitable solid phase supports or carriers include 
any support capable of binding an antigen or an antibody. Well-known supports or 
carriers include glass, polystyrene, polypropylene, polyethylene, dextran, nylon, 

25 amylases, natural and modified celluloses, polyacrylamides, gabbros, and magnetite. 

One skilled in the art will know many other suitable carriers for binding 
antibody or antigen, and will be able to adapt such support for use with the present 
invention. For example, protein isolated from breast or ovarian cells can be run on a 
polyacrylamide gel electrophoresis and immobilized onto a solid phase support such as 

30 nitrocellulose. The support can then be washed with suitable buffers followed by 
treatment with the detectably labeled antibody. The solid phase support can then be 
washed with the buffer a second time to remove unbound antibody. The amount of 
bound label on the solid support can then be detected by conventional means. 

The invention also encompasses kits for detecting the presence of a 

35 marker protein or nucleic acid in a biological sample (e,g. a breast- or ovary-associated 
body fluid such as a urine sample). Such kits can be used to determine if a subject is 
suffering from or is at increased risk of developing breast or ovarian cancer. For 
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example, the kit can comprise a labeled compound or agent capable of detecting a 
marker protein or nucleic acid in a biological sample and means for determining the 
amount of the protein or mRNA in the sample (e.g., an antibody which binds the protein 
or a fragment thereof, or an oligonucleotide probe which binds to DNA or mRNA 
5 encoding the protein). Kits can also include instructions for interpreting the results 
obtained using the kit. 

For antibody-based kits, the kit can comprise, for example: (1) a first 
antibody {e,g. , attached to a solid support) which binds to a marker protein; and, 
optionally, (2) a second, different antibody which binds to either the protein or the first 

10 antibody and is conjugated to a detectable label. 

For oligonucleotide-based kits, the kit can comprise, for example: (1) an 
oligonucleotide, e.g., a detectably labeled oligonucleotide, which hybridizes to a nucleic 
acid sequence encoding a marker protein or (2) a pair of primers useful for amplifying a 
marker nucleic acid molecule. The kit can also comprise, e.g. , a buffering agent, a 

15 preservative, or a protein stabilizing agent. The kit can further comprise components 
necessary for detecting the detectable label {e.g., an enzyme or a substrate). The kit can 
also contain a control sample or a series of control samples which can be assayed and 
compared to the test sample. Each component of the kit can be enclosed within an 
individual container and all of the various containers can be within a single package, 

20 along with instructions for interpreting the results of the assays performed using the kit. 

B. Pharmacogenomics 

The marker of the invention are also useful as pharmacogenomic 
markers. As used herein, a "pharmacogenomic marker" is an objective biochemical 

25 marker whose expression level correlates with a specific clinical drug response or 

susceptibility in a patient (see, e.g., McLeod et al (1999) Euk J. Cancer 35(12): 1650- 
1652). The presence or quantity of the pharmacogenomic marker expression is related 
to the predicted responsive of the patient and more particularly the patient's tumor to 
therapy with a specific drug or class of drugs. By assessing the presence or quantity of 

30 the expression of one or more pharmacogenomic markers in a patient, a drug therapy 
which is most appropriate for the patient, or which is predicted to have a greater degree 
of success, may be selected. For example, based on the presence or quantity of RNA or 
protein encoded by specific tumor markers in a patient, a drug or course of treatment 
may be selected that is optimized for the treatment of the specific tumor likely to be 

35 present in the patient. The use of pharmacogenomic markers therefore permits selecting 
or designing the most appropriate treatment for each cancer patient without trying 
different drugs or regimes. 
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Another aspect of pharmacogenomics deals with genetic conditions that 
alters the way the body acts on drugs. These pharmacogenetic conditions can occur 
either as rare defects or as polymorphisms. For example, glucose-6-phosphate 
dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the 
5 main clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofiirans) and consumption of fava beans. 

. As an illustrative embodiment, the activity of drug metabolizing enzymes 
is a major determinant of both the intensity and duration of drug action. The discovery 
of genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 

10 (NAT 2) and cytochrome P450 enzymes C YP2D6 and C YP2C 1 9) has provided an 
explanation as to why some patients do not obtain the expected drug effects or show 
exaggerated drug response and serious toxicity after taking the standard and safe dose of 
a drug. These polymorphisms are expressed in two phenotypes in the population, the 
extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is 

15 different among different populations. For example, the gene coding for CYP2D6 is 
highly polymorphic and several mutations have been identified in PM, which all lead to 
the absence of fimctional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C 19 quite 
frequently experience exaggerated drug response and side effects when they receive 
standard doses. If a metabolite is the active therapeutic moiety, a PM will show no 

20 therapeutic response, as demonstrated for the analgesic effect of codeine mediated by its 
CYP2D6-formed metabolite morphine. The other extreme are the so called ultra-rapid 
metabolizers who do not respond to standard doses. Recently, the molecular basis of 
ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification. 

Thus, the level of expression of a marker of the invention in an individual 

25 can be determined to thereby select appropriate agent(s) for therapeutic or prophylactic 
treatment of the individual. In addition, pharmacogenetic studies can be used to apply 
genotyping of polymorphic alleles encoding drug-metabolizing enzymes to the 
identification of an individual's drug responsiveness phenotype. This knowledge, when 
applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure 

30 and thus enhance therapeutic or prophylactic efficiency when treating a subject with a 
modulator of expression of a marker of the invention. 

C. Monitoring Clinical Trials 

Monitoring the influence of agents {e.g., drug compounds) on the level of 
35 expression of a marker of the invention can be applied not only in basic drug screening, 
but also in clinical trials. For example, the effectiveness of an agent to affect marker 
expression can be monitored in clinical trials of subjects receiving treatment for breast or 
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ovarian cancer. In a preferred embodiment, the present invention provides a method for 
monitoring the effectiveness of treatment of a subject with an agent (e.g., an agonist, 
antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug 
candidate) comprising the steps of (i) obtaining a pre-administration sample from a 
5 subject prior to administration of the agent; (ii) detecting the level of expression of one 
or more selected markers of the invention in the pre-administration sample; (iii) 
obtaining one or more post-administration samples from the subject; (iv) detecting the 
level of expression of the marker(s) in the post-administration samples; (v) comparing 
the level of expression of the marker(s) in the pre-administration sample with the level 

10 of expression of the marker(s) in the post-administration sample or samples; and (vi) 
altering the administration of the agent to the subject accordingly. For example, 
increased ofexpression of the marker gene(s) during the course of treatment may 
indicate ineffective dosage and the desirability of increasing the dosage. Conversely, 
decreased expression of the marker gene(s) may indicate efficacious treatment and no 

1 5 need to change dosage. 

D. Electronic Apparatus Readable Media and Arrays 
Electronic apparatus readable media comprising a marker of the present 
invention is also provided. As used herein, "electronic apparatus readable media" refers 

20 to any suitable medium for storing, holding or containing data or information that can be 
read and accessed directly by an electronic apparatus. Such media can include, but are 
not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, 
and magnetic tape; optical storage media such as compact disc; electronic storage media 
such as RAM, ROM, EPROM, EEPROM and the like; general hard disks and hybrids of 

25 these categories such as magnetic/optical storage media. The medium is adapted or 
configured for having recorded thereon a marker of the present invention. 

As used herein, the term "electronic apparatus" is intended to include any 
suitable computing or processing apparatus or other device configured or adapted for 
storing data or information. Examples of electronic apparatus suitable for use v^th the 

30 present invention include stand-alone computing apparatus; networks, including a local 
area network (LAN), a wide area network (WAN) Internet, Intranet, and Extranet; 
electronic appliances such as a personal digital assistants (PDAs), cellular phone, pager 
and the like; and local and distributed processing systems. 

As used herein, "recorded" refers to a process for storing or encoding 

35 information on the electronic apparatus readable medium. Those skilled in the art can 
readily adopt any of the presently known methods for recording information on known 
media to generate manufactures comprising the markers of the present invention. 
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A variety of software programs and formats can be used to store the 
marker information of the present invention on the electronic apparatus readable 
medium. For example, the marker nucleic acid sequence can be represented in a word 
processing text file, formatted in commercially-available software such as WordPerfect 
5 and Microsoft Word, or represented in the form of an ASCII file, stored in a database 
application, such as DB2, Sybase, Oracle, or the like, as well as in other forms. Any 
number of data processor structuring formats (e.g., text file or database) may be 
employed in order to obtain or create a medium having recorded thereon the markers of 
the present invention. 

10 By providing the markers of the invention in readable form, one can 

routinely access the marker sequence information for a variety of purposes. For 
example, one skilled in the art can use the nucleotide or amino acid sequences of the 
present invention in readable form to compare a target sequence or target structural 
motif with the sequence information stored within the data storage means. Search 

15 means are used to identify fragments or regions of the sequences of the invention which 
match a particular target sequence or target motif 

The present invention therefore provides a medium for holding 
instructions for performing a method for determining whether a subject has breast or 
ovarian cancer or a pre-disposition to breast or ovarian cancer, wherein the method 

20 comprises the steps of determining the presence or absence of a marker and based on the 
presence or absence of the marker, determining whether the subject has breast or ovarian 
cancer or a pre-disposition to breast or ovarian cancer and/or recommending a particular 
treatment for breast or ovarian cancer or pre-breast or pre-ovarian cancer condition. 

The present invention further provides in an electronic system and/or in a 

25 network, a method for determining whether a subject has breast or ovarian cancer or a 
pre-disposition to breast or ovarian cancer associated with a marker wherein the method 
comprises the steps of determining the presence or absence of the marker, and based on 
the presence or absence of the marker, determining whether the subject has breast or 
ovarian cancer or a pre-disposition to breast or ovarian cancer, and/or recommending a 

30 particular treatment for the breast or ovarian cancer or pre-breast or pre-ovarian cancer 
condition. The method may fiirther comprise the step of receiving phenotypic 
information associated with the subject and/or acquiring from a network phenotypic 
information associated with the subject. 

The present invention also provides in a network, a method for 

35 determining whether a subject has breast or ovarian cancer or a pre-disposition to breast 
or ovarian cancer associated with a marker, said method comprising the steps of 
receiving information associated with the marker receiving phenotypic information 
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associated with the subject, acquiring information from the network corresponding to the 
marker and/or breast or pre-ovarian cancer, and based on one or more of the phenotypic 
information, the marker, and the acquired information, determining whether the subject 
has a breast or ovarian cancer or a pre-disposition to breast or ovarian cancer. The 
5 method may further comprise the step of recommending a particular treatment for the 
breast or ovarian cancer or pre-breast or pre-ovarian cancer condition. 

The present invention also provides a business method for determining 
. whether a subject has breast or ovarian cancer or a pre-disposition to breast or ovarian 
cancer, said method comprising the steps of receiving information associated with the 

10 marker, receiving phenotypic information associated with the subject, acquiring 
information from the network corresponding to the marker and/or breast or ovarian 
cancer, and based on one or more of the phenotypic information, the marker, and the 
acquired information, determining whether the subject has breast or ovarian cancer or a 
pre-disposition to breast or ovarian cancer. The method may further comprise the step of 

1 5 recommending a particular treatment for the breast or ovarian cancer or pre-breast or 
pre-ovarian cancer condition. 

The invention also includes an array comprising a marker of the present 
invention. The array can be used to assay expression of one or more genes in the array. 
In one embodiment, the array can be used to assay gene expression in a tissue to 

20 ascertain tissue specificity of genes in the array. In this manner, up to about 7600 genes 
can be simultaneously assayed for expression. This allows a profile to be developed 
showing a battery of genes specifically expressed in one or more tissues. 

In addition to such qualitative determination, the invention allows the 
quantitation of gene expression. Thus, not only tissue specificity, but also the level of 

25 expression of a battery of genes in the tissue is ascertainable. Thus, genes can be 
grouped on the basis of their tissue expression per se and level of expression in that 
tissue. This is usefiil, for example, in ascertaining the relationship of gene expression 
between or among tissues. Thus, one tissue can be perturbed and the effect on gene 
expression in a second tissue can be determined. In this context, the effect of one cell 

30 type on another cell type in response to a biological stimulus can be determined. Such a 
determination is useful, for example, to know the effect of cell-cell interaction at the 
level of gene expression. If an agent is administered therapeutically to treat one cell 
type but has an undesirable effect on another cell type, the invention provides an assay 
to determine the molecular basis of the undesirable effect and thus provides the 

35 opportunity to co-administer a counteracting agent or otherwise treat the undesired 
effect. Similarly, even within a single cell type, undesirable biological effects can be 
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determined at the molecular level. Thus, the effects of an agent on expression of other 
than the target gene can be ascertained and counteracted. 

In another embodiment, the array can be used to monitor the time course 
of expression of one or more genes in the array. This can occur in various biological 
5 contexts, as disclosed herein, for example development of breast or ovarian cancer, 
progression of breast or ovarian cancer, and processes, such a cellular transformation 
associated with breast or ovarian cancer. 

The array is also useful for ascertaining the effect of the expression of a 
gene on the expression of other genes in the same cell or in different cells. This 
10 provides, for example, for a selection of alternate molecular targets for therapeutic 
intervention if the ultimate or downstream target cannot be regulated. 

The array is also useful for ascertaining differential expression patterns of 
one or more genes in normal and abnormal cells. This provides a battery of genes that 
could serve as a molecular target for diagnosis or therapeutic intervention. 

15 

E. Surrogate Markers 

The markers of the invention may serve as surrogate markers for one or 
more disorders or disease states or for conditions leading up to disease states, and in 
particular, breast or ovarian cancer. As used herein, a "surrogate marker" is an objective 

20 biochemical marker which correlates with the absence or presence of a disease or 
disorder, or with the progression of a disease or disorder (e.g., with the presence or 
absence of a tumor). The presence or quantity of such markers is independent of the 
disease. Therefore, these markers may serve to indicate whether a particular course of 
treatment is effective in lessening a disease state or disorder. Surrogate markers are of 

25 particular use when the presence or extent of a disease state or disorder is difficult to 
assess through standard methodologies (e,g., early stage tumors), or when an assessment 
of disease progression is desired before a potentially dangerous clinical endpoint is 
reached (e.g. , an assessment of cardiovascular disease may be made using cholesterol 
levels as a surrogate marker, and an analysis of HIV infection may be made using HIV 

30 RNA levels as a surrogate marker, well in advance of the undesirable clinical outcomes 
of myocardial infarction or fully-developed AIDS). Examples of the use of surrogate 
markers in the art include: Koomen et qL (2000) J. Mass, Spectrom, 35: 258-264; and 
James (1994) AIDS Treatment News Archive 209. 

The markers of the invention are also useful as pharmacodynamic 

35 markers. As used herein, a "pharmacodynamic marker" is an objective biochemical 
marker which correlates specifically with drug effects. The presence or quantity of a 
pharmacodynamic marker is not related to the disease state or disorder for which the 
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drug is being administered; therefore, the presence or quantity of the marker is indicative 
of the presence or activity of the drug in a subject. For example, a pharmacodynamic 
marker may be indicative of the concentration of the drug in a biological tissue, in that 
the marker is either expressed or transcribed or not expressed or transcribed in that tissue 
5 in relationship to the level of the drug. In this fashion, the distribution or uptake of the 
drug may be monitored by the pharmacodynamic marker. Similarly, the presence or 
quantity of the pharmacodynamic marker may be related to the presence or quantity of 
the metabolic product of a drug, such that the presence or quantity of the marker is 
indicative of the relative breakdown rate of the drug in vivo. Pharmacodynamic markers 

10 are of particular use in increasing the sensitivity of detection of drug effects, particularly 
when the drug is administered in low doses. Since even a small amount of a drug may 
be sufficient to activate multiple rounds of marker transcription or expression, the 
amplified marker may be in a quantity which is more readily detectable than the drug 
itself. Also, the marker may be more easily detected due to the nature of the marker 

1 5 itself; for example, using the methods described herein, antibodies may be employed in 
an immune-based detection system for a protein marker, or marker-specific radiolabeled 
probes may be used to detect a mRNA marker. Furthermore, the use of a 
pharmacodynamic marker may offer mechanism-based prediction of risk due to drug 
treatment beyond the range of possible direct observations. Examples of the use of 

20 pharmacodynamic markers in the art include: Matsuda et al. US 6,033,862; Hattis et al 
(1991) Env. Health PerspecL 90: 229-238; Schentag {\999) Am, 1 Healih-Syst. Pharm. 
56 Suppl. 3: S21-S24; and Nicolau (1999) Am, J. Health-Syst Pharm. 56 Suppl. 3: S16- 
S20. 

25 Experimental Protocol 

A. Identification of Markers And Assembly of their Sequences 
RNA from tumor and normal breast and ovarian tissue samples were 
extracted and amplified by poly-dT primed RT-PCR into cDNA using the SMART PGR 
kit from Clonetech. Amplified cDNA was then labeled using random priming PRIME- 

30 IT from Stratagene with a radioactive nucleotide. Labeled cDNA was hybridized to 

nylon filters spotted with purified PGR product from EST sequences representing known 
and unknown genes. Several thousand clones were spotted on each nylon filter. 
Duplicate independent hybridization experiments were performed to generate 
transcriptional profiling data (see Nature Genetics, 1999, 21). After repeated washings 

35 the nylon filters were scanned and the intensity of each spotted gene was converted 
electronically to indicate expression level in the sample fi^om which the cDNA was 
derived. Tables were generated for each sample showing the expression level for each 
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of the spotted ESTs. These tables were transferred to Microsoft Excel spreadsheets and 
the expression levels for each spotted EST was compared between samples. A total of 
41 tumor samples representing both early and late stage breast cancer and 7 normal 
breast tissue samples were profiled on these EST arrays. Additionally, a total of 70 late 
S stage ovarian tumor samples and S normal ovarian tissue samples were also profiled on 
the EST arrays. ESTs that displayed a 5-fold increase in the expression level over the 
average expression level in the normal samples in at least 30% of the tumor samples 
were exported to a separate data table. 

The corresponding nucleotide sequences for each of these spots were 

10 imported and blasted against both public and proprietary sequence databases in order to 
identify other EST sequences with significant overlap. Thus, contiguous EST sequences 
were assembled into tentative full-length genes. Reblasting of the assembled sequences 
against databases of genes coding for known proteins was done to assess whether the 
assembled gene was a known or unknown protein. Genes in which the potential open 

15 reading frame was still open in the 5' end were experimentally extended by either 

5 'RACE PCR or extracted out from full length cDNA libraries by a simple PCR reaction 
between the vector and 5'end of the assembled electronic sequence. To predict whether 
an assembled gene encodes a potential integral membrane protein, hydropathy 
predictions of the predicted open reading frame was performed (Jones et al, 1 994, 

20 Biochemistry. 33:3038-3049). If the open reading frame contained a predicted signal 
peptide in the N-terminal portion and a single membrane spanning domain, it was 
labeled as being a potential type I transmembrane protein. If the predicted amino acid 
sequence contained a transmembrane domain in the N-terminal portion of the protein, it 
was labeled as being a potential type II transmembrane protein. If the predicted amino 

25 acid sequence was a short hydrophobic protein (<50 amino acids) it was labeled as a 
potential integral membrane protein. If the predicted amino acid sequence contained 
multiple membrane spanning regions it was labeled as a multi-transmembrane (multi- 
TM) region protein 

30 B. Identification of Marker 7 and Marker 23 as Targets for Anti-cancer 

Therapy 

Expression levels of Marker 7, a putative transmembrane protein was >5- 
fold higher in 25/56 breast, 1 7/20 colon and 26/58 ovarian cancer samples compared to 
normal tissues. The full-length gene was cloned and expressed and the protein found to 
35 be localized to the cell surface of transfected cells. Marker 7 does not belong to any 
known protein family and does not show significant homology to any protein in the 
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public databases. Northern blots of various carcinoma cells lines reveal the presence of 
a single mRNA species at approximately 1 .4 kb. 

Expression of Marker 7 in normal and malignant human tissues was 
further evaluated by quantitative PGR analysis. Expression levels in breast, ovary, lung 
S and colon tumor samples were 10-300 fold higher than corresponding normal tissues. In 
addition there was high expression of Marker 7 in in vitro cultured endothelial cells and 
Wilms tumors and hemangiomas, which are highly vascularized tumors. In situ 
hybridization (ISH) on tumor samples showed that Marker 7 is predominantly expressed 
within the tumor stroma and possibly localized to tumor vasciilature. Analysis of 

1 0 normal human tissues, including aorta, by ISH suggested that Marker 7 is not expressed 
on cells within mature vessels. When human tumor cells are transplanted 
subcutaneously in immunodeficient mice, there is an induction of Marker 7 expression 
in the mouse stroma associated with tumor vasculature. Marker 7 is hence found 
expressed in many human cancers, (e.g. breast, ovary, colon, lung and prostate) and not 

1 5 in normal aduh tissue. 

A similar analysis of Marker 23 showed that this marker is stroma 
specific, and is upregulated in ovary, breast, lung and colon cancers. Marker 7 and 
Marker 23 are therefore attractive targets for inhibition of cancers as well as 
angiogenesis in general. Antibodies, antibody derivatives, and antibody fragments 

20 which bind, specifically with Marker 7 or Marker 32 protein (i.e., SEQ ID NOs: 14 and 
64, respectively), or a fragment of the protein, may be used to treat cancer of the breast, 
ovary, lung, colon and prostate as well as generally inhibiting angiogenesis. 

VII. Simimary of the Data in the Tables: 
25 Table 1 lists all of the markers of the invention. 

Table 2 lists Markers 1-33 which were found to be upregulated (i.e., over- 
expressed) by transcription profiling (TP) in breast cancer. The markers were 
upregulated at least 5-fold in >30% of the tumors arrayed. 

Table 3 lists Markers 34-56 which were found to be upregulated by TP in 
30 ovarian cancer. The markers were upregulated at least 5-fold in >30% of the tumors 
arrayed. 

Table 4 lists markers in which additional expression analyses were done 
by either in situ hybridization (ISH), quantitative mRNA analysis (Taqman) or both. 
Table 5 lists markers whose encoded protein were heretofore unknown. 
35 In Tables 1-3 and 5 the following definitions apply: 

"Marker" corresponds to the arbitrary identifier used within this 
application to designate the marker of the invention. 
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"Gene Name" corresponds to the commonly used terminology for the 
marker gene, if it exists. 

"Image Clone ID" corresponds to the cDNA clone number from the 
IMAGE Consortium (see, for example Lennon, G., et al, 1996, Genomics 33:151-152; 
5 and http://www-bioJhil.gov/bbrp/image/image.html) . All referenced IMAGE clone 
sequences are expressly incorporated herein by reference. 

"SEQ ID NO (nts)" designates the entry number in the Sequence Listing 
that corresponds to the nucleotide sequence of the particular marker. "SEQ ID NO 
(AAs)" designates the entry number in the Sequence Listing that corresponds to the 

10 amino acid sequence of the particular marker. Each known sequence submitted to 
GenBank has a unique identifier number, also called the GenBank GI Accession 
Number, for a complete sequence record in the relevant database (see, e.g. 
"http://www.ncbi.nlm.nih.gov/genbank/ query form-html" and " www. derwent .com'' for 
further information). "Acc # (NTS)" corresponds to the GenBank Accession Number for 

15 a nucleotide sequence, while "Acc # (AA)" corresponds to the GenBank Accession 
Number for a protein sequence. "GI # (NTS)" is the GI identification number assigned 
to the nucleotide sequence of the marker gene in the GenBank database (see supra). "GI 
# (AA)" corresponds to the GI sequence identification number assigned to that particular 
protein translation within a nucleotide sequence record in the GenBank database. 

20 The following data is presented in Table 4: 

"Gene" corresponds to the arbitrary identifier used within this application 
to designate the marker of the invention. 

The "TaqMan" and "ISH" columns of Table 4, designate whether 
expression of this marker was analyzed using TaqMan technology or in situ 

25 hybridization, respectively. "Yes" indicates that such analysis was done, while "No" 
similarly indicates that such analysis was not done. "TaqMan" corresponds to the results 
of quantitative PGR analysis using the TaqMan technology. Briefly, TaqMan technology 
relies on standard RT-PCR with the addition of a third gene-specific oligonucleotide 
(referred to as a probe) which has a fluorescent dye coupled to its 5' end (typically 

30 6-FAM) and a quenching dye at the 3' end (typically TAMRA). When the fluorescently 
tagged oligonucleotide is intact, the fluorescent signal from the 5* dye is quenched. As 
PGR proceeds, the 5' to 3* nucleolytic activity of taq polymerase digests the labeled 
primer, producing a free nucleotide labeled with 6-FAM, which is detected as a 
fluorescent signal. The PGR cycle where fluorescence is first released and detected is 

35 directly proportional to the starting amount of the gene of interest in the test sample, thus 
providing a way of quantitating the initial template concentration. 
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"Ovary", "Breast", "Lung", "Colon", and "Prostate" correspond to 
expression as detected by TaqMan analysis in ovarian, breast, lung, colon and prostate 
cancer respectively. Markers scored with a "+" were found to be upregulated by at least 
3-fold in at least 20% of the tumors analyzed (n=>5) in the designated tumor type by 
5 Taqman analysis. Markers scored with a "-" were not found to be upregulated in the 
designated tumor type by Taqman analysis. Expression for markers scored with "ND" 
was not determined in the designated tumor type. In addition, ISH analysis confirmed 
that the genes were expressed by the carcinoma cells, except for Marker 23, which is 
stroma specific and Marker 7 which is expressed mostly in the stroma but can also be 
10 found on tumor cells. Evidence to support this includes Taqman RNA analysis fi-om 
cancer cell lines (breast, ovary, lung, colon and prostate) and ISH. 

The contents of all references, patents, published patent applications, and 
database records including GenBank, IMAGE consortium and Derwent cited throughout 
this application, are hereby incorporated by reference. 

15 
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Other Embodiments 

Those skilled in the art will recognize, or be able to ascertain using no 
more than routine experimentation, many equivalents to the specific embodiments of the 
invention described herein. Such equivalents are intended to be encompassed by the 
5 following claims: 
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What is claimed : 

1 . A method of assessing whether a patient is afflicted with breast cancer, the 
method comprising comparing: 

5 a) the level of expression of a marker in a patient sample, wherein the 

marker comprises SEQ ID NO: 1 , and 

b) the normal level of expression of the marker in a control non- 
cancerous breast sample, 

wherein a significant increase in the level of expression of the marker in the 
10 patient sample and the normal level is an indication that the patient is afflicted with 
breast cancer. 

2. A method of assessing whether a patient is afflicted with ovarian cancer, the 
method comprising comparing: 

1 5 a) the level of expression of a marker in a patient sample, wherein the 

marker comprises SEQ ID NO:67, and 

b) the normal level of expression of the marker in a control non- 
cancerous ovarian sample, 

wherein a significant increase in the level of expression of the marker in the 
20 patient sample and the normal level is an indication that the patient is afflicted with 
ovarian cancer. 

3. An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of SEQ ID NOs: 9, 13, 19, 29, 35, 37, 55 and 89. 

25 

4. A vector which contains the nucleic acid molecule of claim 3. 

5. A host cell which contains the nucleic acid molecule of claim 3. 

30 6. An isolated polypeptide which is encoded by a nucleic acid molecule comprising 
a nucleotide sequence selected from the group consisting of SEQ ID NOs: 9, 13, 19, 29, 
35, 37, 55 and 89. 

7. An antibody which selectively binds to the polypeptide of claim 6. 



35 
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8. An isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOs: 10, 14, 20, 30, 36, 38, 56 and 90. 

9. An antibody which selectively binds to the polypeptide of claim 8. 
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SEQUENCE LISTING 



<110> Millennium Pharmaceuticals/ Inc. et al. 

<120> COMPOSITIONS, KITS, AND METHODS FOR 

IDENTIFICATION, ASSESSMENT, PREVENTION, AND THERAPY OF BREAST 
AND OVARIAN CANCER 



<130> MRI-039PC 



<150> 60/300,159 
<151> 2001-06-21 

<150> 60/301,351 
<151> 2001-03-27 



<160> 112 



<170> FastSEQ for Windows Version 4.0 



<210> 1 

<211> 4275 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 4241, 4243, 4244, 4246, 4247, 4250, 4253, 4254, 4255, 4259, 
4260, 4261, 4262, 4263, 4266, 4270, 4271, 4272, 4273 
<223> n = A,T,C or G 

<400> 1 

ccgggccagg cgcggagctg gcggcagtga caggaggcgc gaacccgcag cgcttaccgc 60 
gcggcgccgc accatggagc ccgccgtgtc gctggccgtg tgcgcgctgc tcttcctgct 120 
gtgggtgcgc ctgaaggggc tggagttcgt gctcatccac cagcgctggg tgttcgtgtg 180 
cctcttcctc ctgccgctct cgcttatctt cgatatctac tactacgtgc gcgcctgggt 240 
ggtgttcaag ctcagcagcg ctccgcgcct gcacgagcag cgcgtgcggg acatccagaa 300 
gcaggtgcgg gaatggaagg agcagggtag caagaccttc atgtgcacgg ggcgccctgg 360 
ctggctcact gtctcactac gtgtcgggaa gtacaagaag acacacaaaa acatcatgat 420 
caacctgatg gacattctgg aagtggacac caagaaacag attgtccgtg tggagccctt 480 
ggtgaccatg ggccaggtga ctgccctgct gacctccatt ggctggactc tccccgtgtt 54 0 
gcctgagctt gatgacctca cagtgggggg cttgatcatg ggcacaggca tcgagtcatc 600 
atcccacaag tacggcctgt tccaacacat ctgcactgct tacgagctgg tcctggctga 660 
tggcagcttt gtgcgatgca ctccgtccga aaactcagac ctgttctatg ccgtaccctg 720 
gtcctgtggg acgctgggtt tcctggtggc cgctgagatc cgcatcatcc ctgccaagaa 780 
gtacgtcaag ctgcgtttcg agccagtgcg gggcctggag gctatctgtg ccaagttcac 84 0 
ccacgagtcc cagcggcagg agaaccactt cgtggaaggg ctgctctact ccctggatga 900 
ggctgtcatt atgacagggg tcatgacaga tgaggcagag cccagcaagc tgaatagcat 960 
tggcaattac tacaagccgt ggttctttaa gcatgtggag aactatctga agacaaaccg 1020 
agagggcctg gagtacattc ccttgagaca ctactaccac cgccacacgc gcagcatctt 1080 
ttgggagctc caggacatca tcccctttgg caacaacccc atcttccgct acctctttgg 114 0 
ctggatggtg cctcccaaga tctccctcct gaagctgacc cagggtgaga ccctgcgcaa 1200 
gctgtacgag cagcaccacg tggtgcagga catgctggtg cccatgaagt gcctgcagca 1260 
ggccctgcac accttccaaa acgacatcca cgtctacccc atctggctgt gtccgttcat 1320 
cctgcccagc cagccaggcc tagtgcaccc caaaggaaat gaggcagagc tctacatcga 1380 
cattggagca tatggggagc cgcgtgtgaa acactttgaa gccaggtcct gcatgaggca 1440 
gctggagaag tttgtccgca gcgtgcatgg cttccagatg ctgtatgccg actgctacat 1500 
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gaaccgggag gagttctggg agatgtttga tggctccttg taccacaagc tgcgagagaa 1560 
gctgggttgc caggacgcct tccccgaggt gtacgacaag atctgcaagg ccgccaggca 1620 
ctgagctgga gcccgcctgg agagacagac acgtgtgagt ggtcaggcat cttcccttca 1680 
ctcaagcttg gctgctttcc tagatccaca ctttcaaaga gaaacccctc cagaactccc 1740 
accctgacag cccaacacca ccttcctcct ggcttccagg gggcagccca gtggaatgga 1800 
aagaatgtgg gatttggagt cagacaagcc tgagtccagt tccccgttta gaactcatta 1860 
gctgtgtgac tctgggtgag tcccttaacc cctctgagcc cgggtctctt cattagttga 1920 
aagggatagt aatacctact tgcaggttgt tgtcatctga gttgagcact ggtcacattg 1980 
aaggtgctgg gtaagtggta gctcttgttg cttcccgttc agcgtcacat ctgcagtgga 2040 
gcctgaaaag gctccacatt aggtcacctg tgcacagcca tggctggaat gatgaagggg 2100 
atacgctgga gttgccctgc catcgcctcc atcagccaga cgaggtcctc acaggagaag 2160 
gacagctctt ccccaccctg ggatctcagg agggcagcca cggagtgggg aggccccaga 2220 
tgcgctgtgc caaagccagg tccgaggcca aagttctccc tgccatcctt ggtgccgtcc 2280 
tgccccttcc tccttcatgc ctgggcctgc aggcccaccc cagccaccac tgagtccact 2340 
cggagtgccc tgtgttcctg gagaaggcat tccagggttg aatcttgtcc cagcctcagc 2400 
ctgggacacc taggtggaga gagtggtctc cgctctgaat tggatccagg ggacctgggc 24 60 
tcattcttct tggctcacca accctgcagg cctcatcttt cccaaaaccc actttgtctt 2520 
ggtgggagtg ggtccgcgct gctctgcagc aggggctggg gagtggacag catcaggtgg 2580 
gaaagtggag tccaccctca tgtttctgta ggattctcac cgtggggctg gaagaaaaga 2640 
gcatcgactt gatttctcca accactcatc cctctttttc tttcttccac cactccccac 2700 
cccagctgta gttaatttca gtgccttaca aatcctaagc tcagagaaag ttccatttcc 2760 
gttccagagg gaagggaacc tccctaggtc cttccctggc ttgttataac gcaaagcttg 2820 
gttgtttatg caactctatc ttaagaactg cccagcctca gctgaaaacc cgaatctgag 2880 
aaggaattgc gtcatgtaag ggaagctgga attaagggag ctgagccagt catggttgtg 2940 
gcgtgtgagt caggagacct aggtttcagc ccctctctac tgtcagcgag ctgtgcaacg 3000 
tgggcaagtc attgtcctct gagctgcagt ttcctcatct gtcacatcgc tacagacaag 3060 
acctccctgg aacccttctg attgtcttag acactgtggt tgcaaaaccc acggaaagcc 3120 
tcatttgtgt ggaaagtcag aggaaaaatg atccagtgga cacttgggga ttatctgtca 3180 
ttcaagatcc ttccttcaac cccaaggtca gctcccatct catttccaga aaggctcata 3240 
cctggcttgc agggaagcat ctgtcttgtc attccaggtg ccagaatcct ctcagagtca 3300 
ttgaagggtg ttcacccatc ccacccaagg cttggcacac tgccagtgtc ttagcagggt 3360 
cttgtgaggg ctgggggcat ccaggcactc agaaggcaaa ggaaccaccc tacccatttg 3420 
gcctctggag ggggcagaag aaagaaagaa acctcatcct atattttaca aagcatgtga 3480 
attctggcat tagctctcat aggagaccca tgtgcttcct tgctcagtgc aaaactgatg 3540 
attctacttg ctgtagatga atggttaaca cgagctagtt aaacagtgcc attgttttgc 3600 
cagtgaagcc tccaacccta agccactggg acggtggcca gagatgccag cagcctctgt 3660 
cgcccttagt catataacca aaatccagac cttatccaca acccggggct tggaaaggaa 3720 
ggtattttgg aatcacaccc tccggttatg ttgctccagt aaaatcttgc ctggaaagag 3780 
gcagtcttct tagcatggtg agctgagttc atggcttttt tttgtagcca gtcctgtccc 3840 
tggccatcca tgtgatggtt ttggatggag ttaaacttga tgccagtggg cagtgcatgt 3900 
ggaaagtatc agagtaagcc tctcccctcc agagccctga gtttcttggc tgcatgaagg 3960 
ttttctttag aatcagaatt gtagccagtt tctttggcca gaaggatgaa tacttggata 4020 
ttactgaaag ggaggggtgg agatgggtgt ggcagtgtat ggtgtgtgat ttttattttc 4080 
ttctttggtc atgggggcca aggagaaagg catgaatctt ccctgtcagg ctcttacagc 4140 
cacaggcact gtgtctactg tctggaagac atgtccccgt ggctgtgggg ccgctgcttc 4200 
tgtttaaata aaagtggcct ggaaaaaaaa aaaaaaaaaa ngnnannstn yknnnctknn 4260 
nnngtnhgsn nnnts 4275 

<210> 2 
<211> 516 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Glu Pro Ala Val Ser Leu Ala Val Cys Ala Leu Leu Phe Leu Leu 

15 10 15 

Trp Val Arg Leu Lys Gly Leu Glu Phe Val Leu He His Gin Arg Trp 

20 25 30 

Val Phe Val Cys Leu Phe Leu Leu Pro Leu Ser Leu He Phe Asp He 
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35 40 



Tyr 


Tyr 


Tyr 


Val 


Arg Ala 


Trp 


Val 












55 




Arg 


Leu 


His 


Glu Gin Arg 


Val 


Arg 


65 








70 






Trp 


Lys 


Glu 


Gin Gly Ser 


Lys 


rp W w> 

Tfir 










85 






Trp 


Leu 


ni w w 

Tnr 


Val 


Ser Leu 


Arg 


Val 








100 








Asn 


lie 


Met 


He 


Asn Leu 


Met 


Asp 






lib 










Gin 


T 1 A 

lie 


Val 


Arg 


Val Glu 


Pro 


Leu 












i Jb 




Leu 


Leu 


Thr 


Ser 


He Gly 


Trp 


Thr 


14 b 








150 






Asp 


Leu 


Thr 


Val 


Gly Gly 


Leu 


He 










165 






Ser 


His 


Lys 


Tyr Gly Leu 


Phe 


Gin 








180 








Val 


Leu 


Ala 


Asp Gly Ser 


Phe 


Val 






195 








200 


Asp 


Leu 


Phe 


Tyr 


Ala Val 


Pro 


Trp 




210 








215 




Val 


TV 1 

Ala 


Ala 


Glu 


He Arg 


He 


He 


225 








230 






Arg 


Phe 


Glu 


Pro 


Val Arg 


Gly Leu 










245 






His 


Glu 


Ser 


Gin Arg Gin 


Glu 


Asn 








260 








Ser 


Leu 


Asp 


Glu 


Ala Val 


He 


Met 






275 








280 


Glu 


Pro 


Ser 


Lys 


Leu Asn 


Ser 


He 




o n rk 

290 








295 




Phe 


Lys 


His 


Val 


Glu Asn 


Tyr 


Leu 


305 








310 






Tyr 


lie 


Pro 


Leu 


Arg His 


Tyr 


Tyr 










325 






Trp 


Glu 


Leu 


Gin 


Asp He- 


He 


Pro 








340 








Tyr 


Leu 


Phe 


Gly Trp Met 


Val 


Pro 














360 


Thr 


Gin 


Gly 


Glu 


Thr Leu 


Arg 


Lys 




J /U 








375 




Gin 


Asp 


Met 


Leu 


Val Pro 


Met 


Lys 










390 






Pne 


Gin 


Asn 


Asp 


He His 


Val 


Tyr 










405 






T Mil 

Leu 


Pro 


Ser 


Gin 


Pro Gly 


Leu 


Val 








420 








Leu 


Tyr 


T 1 ^ 

lie 


Asp 


He Gly 


Ala 


Tyr 














440 


Glu 


Ala 


Arg 


Ser 


Cys Met 


Arg 


Gin 




450 








455 




His 


Gly 


Phe 


Gin 


Met Leu 


Tyr Ala 


465 








470 






Phe 


Trp 


Glu 


Met 


Phe Asp 


Gly Ser 










485 






Leu 


Gly 


Cys 


Gin 


Asp Ala 


Phe 


Pro 








500 

















45 








val 


rtie 


Lys 


Leu 


Ser 


Ser 


Ala 


Pro 








60 










Asp 


JLie 


Gin 


Lys 


Gin 


Val 


Arg Glu 






75 










80 


Phe 


Met 


Cys 


Thr 


Gly Arg 


Pro Gly 




90 










ft c 

95 




Gly 


Lys 


Tyr 


Lys 


Lys 


Thr 


His 


Lys 


105 










110 






He 


Leu 


Glu 


Val 


Asp Thr 


Lys 


Lys 










125 








Val 


Thr 


Met 


Gly 


Gin 


Val 


1 nr 


Ala 








140 










Leu 


Pro 


Val 


Leu 


Pro 


Glu 


Leu 


Asp 






155 










160 


Met 


Gly Thr Gly 


He 


Glu 


Ser 


Ser 




170 










175 




His 


He 


Cys 


Thr 


Ala 


Tyr 


Glu 


Leu 


185 










190 






Arg 


Cys 


Thr 


Pro 


Ser 


Glu 


Asn 


Ser 










205 








Ser 


Cys 


Gly 


Thr 


Leu 


Gly 


Phe 


Leu 








220 










Pro 


Ala 


Lys 


Lys 


Tyr Val 


Lys 


Leu 






235 










240 


Glu 


Ala 


He 


Cys 


Ala 


Lys 


Phe 


Thr 




250 










255 




His 


Phe 


Val 


Glu 


Gly 


Leu 


Leu 


Tyr 


265 










270 






Thr 


Gly Val 


Met 


Thr Asp 


Glu 


Ala 










285 








Gly 


Asn 


Tyr 


Tyr 


Lys 


Pro 


Trp 


Phe 








300 










Lys 


Thr Asn Arg 


Glu Gly 


Leu 


Glu 






315 










320 


His 


Arg 


His 


Thr 


Arg 


Ser 


He 


Phe 




330 










335 




Phe 


Gly- Asn Asn 


Pro 


He 


Phe Arg 


345 










350 






Pro 


Lys 


He 


Ser 


Leu 


Leu 


Lys 


Leu 










365 








Leu 


Tyr 


Glu 


Gin 


His 


His 


Val 


Val 








380 










Cys 


Leu 


Gin 


Gin 


Ala 


Leu 


His 


Thr 






395 










400 


Pro 


He 


Trp 


Leu 


Cys 


Pro 


Phe 


He 




410 










415 




nlS 


Pro Lys Gly 


Asn 


Glu 


Ala 


Glu 


425 










430 






Gly 


Glu 


Pro 


Arg 


Val 


Lys 


His 


Phe 










445 








Leu 


Glu 


Lys 


Phe 


Val 


Arg 


Ser 


Val 








4 60 










Asp 


Cys 


Tyr Met 


Asn Arg 


Glu Glu 






475 










480 


Leu 


Tyr 


His 


Lys 


Leu Arg 


Glu Lys 




490 










4 95 




Glu 


Val 


Tyr Asp 


Lys 


He 


Cys 


Lys 


505 










510 
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Ala Ala Arg His 
515 



<210> 3 

<211> 2342 

<212> DNA 

<213> Homo sapiens 

<400> 3 

gtcgacccac gcgtccggca gggccaacag tcacagcagc cctgaccaga gcattcctgg 60 
agctcaagct cctctacaaa gaggtggaca gagaagacag cagagaccat gggacccccc 120 
tcagcccctc cctgcagatt gcatgtcccc tggaaggagg tcctgctcac agcctcactt 180 
ctaaccttct ggaacccacc caccactgcc aagctcacta ttgaatccac gccgttcaat 240 
gtcgcagagg ggaaggaggt tcttctactc gcccacaacc tgccccagaa tcgtattggt 300 
tacagctggt acaaaggcga aagagtggat ggcaacagtc taattgtagg atatgtaata 360 
ggaactcaac aagctacccc agggcccgca tacagtggtc gagagacaat ataccccaat 420 
gcatccctgc tgatccagaa cgtcacccag aatgacacag gattctatac cctacaagtc 480 
ataaagtcag atcttgtgaa tgaagaagca accggacagt tccatgtata cccggagctg 540 
cccaagccct ccatctccag caacaactcc aaccccgtgg aggacaagga tgctgtggcc 600 
ttcacctgtg aacctgaggt tcagaacaca acctacctgt ggtgggtaaa tggtcagagc 660 
ctcccggtca gtcccaggct gcagctgtcc aatggcaaca tgaccctcac tctactcagc 720 
gtcaaaagga acgatgcagg atcctatgaa tgtgaaatac agaacccagc gagtgccaac 780 
cgcagtgacc cagtcaccct gaatgtcctc tatggcccag atggccccac catttccccc 840 
tcaaaggcca attaccgtcc aggggaaaat ctgaacctct cctgccacgc agcctctaac 900 
ccacctgcac agtactcttg gtttatcaat gggacgttcc agcaatccac acaagagctc 960 
tttatcccca acatcactgt gaataatagc ggatcctata tgtgccaagc ccataactca 1020 
gccactggcc tcaataggac cacagtcacg atgatcacag tctctggaag tgctcctgtc 1080 
ctctcagctg tggccaccgt cggcatcacg attggagtgc tggccagggt ggctctgata 1140 
tagcagccct ggtgtatttt cgatatttca ggaagactgg cagattggac cagaccctga 1200 
attcttctag ctcctccaat cccattttat cccatggaac cactaaaaac aaggtctgct 1260 
ctgctcctga agccctatat gctggagatg gacaactcaa tgaaaattta aagggaaaac 1320 
cctcaggcct gaggtgtgtg ccactcagag acttcaccta actagagaca ggcaaactgc 1380 
aaaccatggt gagaaattga cgacttcaca ctatggacag cttttcccaa gatgtcaaaa 1440 
caagactcct catcatgata aggctcttac ccccttttaa tttgtccttg cttatgcctg 1500 
cctctttcgc ttggcaggat gatgctgtca ttagtatttc acaagaagta gcttcagagg 1560 
gtaacttaac agagtatcag atctatcttg tcaatcccaa cgttttacat aaaataagag 1620 
atcctttagt gcacccagtg actgacatta gcagcatctt taacacagcc gtgtgttcaa 1680 
atgtacagtg gtccttttca gagttggact tctagactca cctgttctca ctccctgttt 1740 
taattcaacc cagccatgca atgccaaata atagaattgc tccctaccag ctgaacaggg ' 1800 
aggagtctgt gcagtttctg acacttgttg ttgaacatgg ctaaatacaa tgggtatcgc 1860 
tgagactaag ttgtagaaat taacaaatgt gctgcttggt taaaatggct acactcatct 1920 
gactcattct ttattctatt ttagttggtt tgtatcttgc ctaaggtgcg tagtccaact 1980 
cttggtatta ccctcctaat agtcatacta gtagtcatac tccctggtgt agtgtattct 2040 
ctaaaagctt taaatgtctg catgcagcca gccatcaaat agtgaatggt ctctctttgg 2100 
ctggaattac aaaactcaga gaaatgtgtc atcaggagaa catcataacc catgaaggat 2160 
aaaagcccca aatggtggta actgataata gcactaatgc tttaagattt ggtcacactc 2220 
tcacctaggt gagcgcattg agccagtggt gctaaatgct acatactcca actgaaatgt 2280 
taaggaagaa gatagatcca attaaaaaaa aaaaaaaaaa aaaaaaaaaa aagggcggcc 2340 
gc. 2342 

<210> 4 
<211> 344 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Gly Pro Pro Ser Ala Pro Pro Cys Arg Leu His Val Pro Trp Lys 
15 10 15 



wo 03/000012 



5 



PCT/US02/19773 



o J. u 


Val 


Leu 


Leu 


Thr 
X iix 


rixci 


OCX 


T.011 
XiCSU 


Leu 


Thr 


Phe 


Trp Asn 


Pro 


Pro 


Thr 




























30 






Thr 


Ala 


Lys 


Leu 


Thr 


He 


ox Ll 


Qor 

OCX 


Th-r 
X iix 


Pro 


Phe 


Asn 


Val 


Ala 


Glu Gly 






35 




















45 








Lys 


Glu 


Val 


Leu 


Leu 


Leu 


Ala 


His 


Asn 


Leu 


Pro 


Gin Asn Arg 


He Gly 
























fin 












Ser 


Trn 


Tvr 
1 yx 


Lys 


Gly 


ox u 


Arrr 
ni. ^ 


Va 1 


Asp Gly 




oer 


Leu 


He 


Val 


\j J 










70 




















80 




i yr 


V oix 


X X c 


Gly Thr 


oxn 


Gin 


Ala 


I nx 


Irx O 


ox y 


r ro 


ALB 


Tyr 


Ser 










85 




















95 








ox u 


Thr 

J. I IX 


He 


Tyr 


irx O 


Asn 


Ala 
105 


OCX 


Leu 


XiC u 


T 1 *3 
X Xc 


oxn 

110 
X X u 


Asn 


Val 


Thr 


Gin 


Asn 


Asp 


Thr Gly 


Phe 


Tyr 


Thr 


Leu 


oJ.n 


VClX 


X X c 


Lys 


Ser Asp 






lis 

X X □ 










120 










1 9 R 








Ij6U 


V CI X 


A Q r*i 


w X U 


Glu 


Ala 


Thr 


Gly Gin 


trne 


nxs 


vaX 


Tyr 


Pro 


Glu 


Leu 




X O V/ 










135 










X *x U 










Prn 




t X u 


OCX 


He 


Ser 


Ser 


Asn 


Asn 


Ser 


Asn 


tr X O 


vax 


Pin 


Asp 


Lys 












150 










155 










160 






Vox 


rlX d 


Phe 
165 


Thr 


Cys 


blU 


Pro 


blU 

170 


Val 


Gin 


Asn 


Thr 


Thr 
175 


Tyr 


L611 


±rp 


T T*ri 


Va 1 
V d X 


Asn Gly Gin 


Ser 


Leu 


Pro 


val 


Ser 


Pro 


Arg 


Leu 


Gin 








X o u 










185 










190 






L6U 


OC J. 


/*ioii 


ox y 


Asn 


Met 


Thr 


Leu 


Thr 


Leu 


Leu 


Ser 


Val 


Lys 


Arg Asn 






X 










200 










205 








Asp 


Ala 


Gly 


Ser 


Tyr Glu Cys 


Glu 


He 


Gin 


TV <-> 

Asn 


Pro 


Ala 


Ser 


Ala 


Asn 




91 n 

^ J. u 










215 










220 










Arg 


Ser 


Asp 


Pro 


Val 


Thr 


Leu 


Asn 


Val 


Leu 


Tyr 


Gly 


Pro 


Asp 


Gly 


Pro 












230 










235 










240 


Thr- 
1 IlX 




0€X 


Pro 


Ser 


Lys 


Ala 


Asn 


Tyr 


Arg 


Pro 


Gly Glu Asn 


Leu 


Asn 










245 










250 










255 








^ys 


9 fin 


Ala 


Ala 


Ser 


Asn 


Pro 

265 


Pro 


Ala 


Gin 


Tyr 


Ser 
270 


Trp 


Phe 


He 


Asn 


Gly 


Thr 


Phe 


Gin 


Gin 


Ser 
280 


Thr 


Gin 


Glu 


Leu 


Phe 
285 


He 


Pro 


Asn 


He 


Thr 


Val 


Asn 


Asn 


Ser Gly 


Ser 


Tyr 


Met 


Cys 


Gin 


Ala 


His 


Asn 


Ser 














295 










300 










Ala 


Thr 


Gly 


Leu 


Asn 


Arg 


Thr 


Thr 


Val 


Thr 


Met 


He 


Thr 


Val 


Ser 


Gly 


305 










310 










315 










320 


Ser 


Ala 


Pro 


Val 


Leu 
325 


Ser 


Ala 


Val 


Ala 


Thr 
330 


Val 


Gly 


He 


Thr 


He 
335 


Gly 


Val 


Leu 


Ala 


Arg 
340 


Val 


Ala 


Leu 


He 



















<210> 5 
<211> 2557 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 47, 2120, 2127, 2131, 2135; 2143, 2144, 2162, 2166, 2172, 
2186, 2192, 2200, 2219, 2246, 2265, 2375, 2376, 2377, 2411, 
2439, 2456, 2457, 2458, 2461, 2462, 2552, 2553, 2554, 2555, 
2556, 2557 ' 
<223> n = A,T,C or G 

<400> 5 

gggcgaaggg gcagccgcag cgcagaggcc cgccccgccc tcccctnccg tcacagccca 60 
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gccttccggc ccttgggctg ctcgcggcct ttttttcccg gctgggctcg ggctcagctc 120 

gactgggctc ggcgggcggc ggcggcggcg ccggcggctg gcggaggagg gagggcgagg 180 

gcgggcgcgg gccggcgggc gggcggaaga gggaggagag gcgcggggag ccaggcctcg 24 0 

gggcctcgga gcaaccaccc gagcagacgg agtacacgga gcagcggccc cggccccgcc 300 

aacgctgccg ccgggatgct ccagaccttg tatgattact tctggtggga acgtctgtgg 360 

ctgcctgtga acttgacctg ggccgatcta gaagaccgag atggacgtgt ctacgccaaa 420 

gcctcagatc tctatatcac gctgcccctg gccttgctct tcctcatcgt tcgatacttc 480 

tttgagctgt acgtggctac accactggct gccctcttga acataaagga gaaaactcgg 540 

ctgcgggcac ctcccaacgc caccttggaa catttctacc tgaccagtgg caagcagccc 600 

aagcaggtgg aagtagagct tttgtcccgg cagagcgggc tctctggccg ccaggtagag 660 

cgttggttcc gtcgccgccg caaccaggac cggcccagtc tcctcaagaa gttccgagaa 720 

gccagctgga gattcacatt ttacctgatt gccttcattg ccggcatggc cgtcattgtg 780 

gataaaccct ggttctatga catgaagaaa gtttgggagg gatatcccat acagagcact 840 

atcccttccc agtattggta ctacatgatt gaactttcct tctactggtc cctgctcttc 900 

agcattgcct ctgatgtcaa gcgaaaggat ttcaaggaac agatcatcca ccatgtggcc 960 

accatcattc tcatcagctt ttcctggttt gccaattaca tccgagctgg gactctaatc 1020 

atggctctgc atgactcttc cgattacctg ctggagtcag ccaagatgtt taactacgcg 1080 

ggatggaaga acacctgcaa caacatcttc atcgtcttcg ccattgtttt tatcatcacc 1140 

cgactggtca tcctgccctt ctggatcctg cattgcaccc tggtgtaccc actggagctc 1200 

tatcctgcct tctttggcta ttacttcttc aattccatga tgggagttct acagctgctg 1260 

catatcttct gggcctacct cattttgcgc atggcccaca agttcataac tggaaagctg 1320 

gtagaagatg aacgcagtga ccgggaagaa acagagagct cagaggggga ggaggctgca 1380 

gctgggggag gagcaaagag ccggccccta gccaatggcc accccatcct caataacaac 14 40 

catcgtaaga atgactgaac cattattcca gctgcctccc agattaatgc ataaagccaa 1500 

ggaactaccc cgctccctgc gctatagggt cactttaagc tctggggaaa aaggagaaag 1560 

tgagaggaga gttctctgca tcctccctcc ttgcttgtca cccagttgcc tttaaaccaa 1620 

attctaacca. gcctatcccc aggtaggggg acgttggtta tattctgtta gagggggacg 1680 

gtcgtatttt cctccctacc cgccaagtca tcctttctac tgcttttgag gccctccctc 1740 

agctctctgt gggtaggggt tacaattcac attccttatt ctgagaattt ggccccagct 1800 

gtttgccttt gactccctga cctccagagc cagggttgtg ccttattgtc ccatctgtgg 1860 

gcctcattct gccaaagctg gaccaaggct aacctttcta agctccctaa cttgggccag 1920 

aaaccaaagc tgagctttta actttctccc tctatgacac aaatgaattg agggtaggag 1980 

gagggtgcac ataaccctta ccctacctct gccaaaaagt gggggctgta ctggggactg 2040 

ctcggatgat ctttcttagt gctacttctt tcagctgtcc ctgtagcgac aggtctaaga 2100 

tctgactgcc tcctcctctn ctctggncct ncttnccccc ttnnccctct tctcttcagc 2160 

tnaggnctag cntggtttgg agtagnaatg gncaactaan ttctaatttt tatttattna 2220 

aatatttggg gttttggttt taaagnccag aattacggct agcancctag catttcagca 2280 

gagggaccat tttagaccaa aatgtactgt taatgggttt ttttttaaaa ttaaaagatt 2340 

aaataaaaaa tattaaataa aaaaaaaaaa taagnnncag actattagga attgagaagg 2400 

gggatcaact naaataaacg aagagagtct ttcttatgnm tgccttavma aaaaannncc 24 60 

nnacaaaaaa acgggggggg ggccttacaa attttaaaaa aaaaaccccc cccccccccc 2520 
cccggaaccg aaaaaaaaaa aaaagcccca annnnnn 2557 



<210> 6 
<211> 380 
<212> PRT 

<213> Homo sapiens 



<400> 6 














Met Leu 


Gin 


Thr 


Leu 


Tyr 


Asp Tyr 


1 






5 








Pro Val 


Asn 


Leu 
20 


Thr 


Trp 


Ala 


Asp 


Tyr Ala 


Lys 
35 


Ala 


Ser 


Asp 


Leu 


Tyr 
40 


Phe Leu 


He 


Val 


Arg 


Tyr 


Phe 


Phe 


50 










55 




Ala Ala 


Leu 


Leu 


Asn 


He 


Lys 


Glu 


65 








70 







Phe Trp Trp Glu Arg Leu Trp Leu 

10 15 
Leu Glu Asp Arg Asp Gly Arg Val 
25 30 
He Thr Leu Pro Leu Ala Leu Leu 
45 

Glu Leu Tyr Val Ala Thr Pro Leu 
60 

Lys Thr Arg Leu Arg Ala Pro Pro 
75 80 
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Asn 


Ala 


Thr 


Leu 


Glu 


His 


Phe 


Tyr 


Leu 










85 










Gin 


Val 


Glu 


Val 


Glu 


Leu 


Leu 


Ser 


Arg 








100 










105 


bin 


vai 


blU 


Arg 


Trp 


Phe 


Arg 


Arg 


Arg 






115 










120 




Leu 


Leu 


Lys 


Lys 


Phe 


Arg 


Glu 


Ala 


Ser 




130 










135 






lie 


Ala 


Phe 


lie 


Ala Gly 


Met 


Ala 


Val 


145 










150 








Tyr 


Asp 


Met 


Lys 


Lys 


Val 


Trp 


Glu 


Gly 










165 










Pro 


Ser 


Gin 


Tyr 


Trp Tyr 


Tyr 


Met 


He 








180 










185 


Leu 


Leu 


Phe 


Ser 


He 


Ala 


Ser 


Asp 


Val 






195 










200 




Gin 


lie 


lie 


His 


His 


Val 


Ala 


Thr 


He 




210 










215 






Phe 


Ala 


Asn 


Tyr 


He 


Arg 


Ala 


Gly 


Thr 


225 










230 








Ser 


Ser 


Asp 


Tyr 


Leu 


Leu 


Glu 


Ser 


Ala 










245 










Trp 


Lys 


Asn 


Thr 


Cys 


Asn 


Asn 


He 


Phe 








260 










265 


lie 


lie 


Thr 


Arg 


Leu 


Val 


He 


Leu 


Pro 






275 










280 




Leu 


Val 


Tyr 


Pro 


Leu 


Glu 


Leu 


Tyr 


Pro 




290 










295 






Phe 


Asn 


Ser 


Met 


Met 


Gly 


Val 


Leu 


Gin 


305 










310 








Tyr 


Leu 


lie 


Leu 


Arg 


Met 


Ala 


His 


Lys 










325 










Glu Asp 


Glu 


Arg 


Ser Asp 


Arg 


Glu 


Glu 








340 










345 


Glu 


Ala 


Ala 


Ala 


Gly 


Gly 


Gly 


Ala 


Lys 






355 










360 




His 


Pro 


lie 


Leu 


Asn 


Asn 


Asn 


His 


Arg 




370 










375 
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Thr 


Ser 


Gly 


Lys 


Gin 


Pro 


Lys 


90 










95 




Gin 


Ser 


Gly 


Leu 


Ser 


Gly Arg 










110 






Arg 


Asn 


Gin 


Asp 
125 


Arg 


Pro 


Ser 


Trp 


Arg 


Phe 
14 0 


Thr 


Phe 


Tyr 


Leu 


He 


Val 

ICC 

155 


Asp 


Lys 


Pro 


Trp 


Phe 
160 


Tyr 


Pro 


He 


Gin 


Ser 


Thr 


He 


170 










175 




Glu 


Leu 


Ser 


Phe 


Tyr 
190 


Trp 


Ser 


Lys 


Arg 


Lys 


Asp 

205 


Phe 


Lys 


Glu 


He 


Leu 


He 


Ser 


Phe 


Ser 


Trp 






220 








Leu 


He 


Met 


Ala 


Leu 


His Asp 




235 










240 


Lys 


Met 


Phe 


Asn 


Tyr 


Ala 


Gly 


250 










255 




He 


Val 


Phe 


Ala 


He 
270 


Val 


Phe 


Phe 


Trp 


He 


Leu 

285 


His 


Cys 


Thr 


Ala 


Phe 


Phe 
300 


Gly 


Tyr 


Tyr 


Phe. 


Leu 


Leu 


His 


He 


Phe 


Trp Ala 




315 










320 


rne 


He 


Thr 


Gly 


Lys 


Leu 


Val 


330 










335 




Thr 


Glu 


Ser 


Ser 


Glu 


Gly Glu 










350 






Ser 


Arg 


Pro 


Leu 


Ala 


Asn Gly 








365 








Lys 


Asn 


Asp 











380 



<210> 7 
<211> 1861 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> 1249 

<223> n = A, T,C or G 



<400> 7 

aagcgggcvc gagccgccgc gcgcgcgccg 

cccacgtctg cgttgctgcc ccgcctgggc 

tcagggaacc tccgccggag tcgaatttac 

atggtttgcg ggggcttcgc gtgttccaag 

accttggtta gtctgctgct aattggaatt 

tccagtctcc gagtggtcgg cgtggtcatt 

ttagtgggtc tgattggagc tgtaaaacat 

attctgttac ttgtatttat tgttcagttt 



cgcactgcag ccccaggccc cggcccccca 60 

cgggccccaa aggcaaggac aaagcagctg 120 

gtgcagctgc cggcaaccac aggttccaag 180 

aactgcctgt gcgccctcaa cctgctttac 240 

gctgcgtggg gcattggctt cgggctgatt 300 

gcagtgggca tcttcttgtt cctgattgct 360 

catcaggtgt tgctattttt ttatatgatt 420 

tctgtatctt gcgcttgttt agccctgaac 480 



wo 03/000012 



8 



PCT/US02/19773 



caggagcaac agggtcagct tctggaggtt ggttggaaca atacggcaag tgctcgaaat 540 

gacatccaga gaaatctaaa ctgctgtggg ttccgaagtg ttaacccaaa tgacacctgt 600 

ctggctagct gtgttaaaag tgaccactcg tgctcgccat gtgctccaat cataggagaa 660 

tatgctggag aggttttgag atttgttggt ggcattggcc tgttcttcag ttttacagag 720 

atcctgggtg tttggctgac ctacagatac aggaaccaga aagacccccg cgcgaatcct 780 

agtgcattcc tttgatgaga aaacaaggaa gatttccttt cgtattatga tcttgttcac 840 

tttctgtaat tttctgttaa gctccatttg ccagtttaag gaaggaaaca ctatctggaa 900 

aagtacctta ttgatagtgg aattatatat ttttactcta tgtttctcta catgtttttt 960 

tctttccgtt gctgaaaaat atttgaaact tgtggtctct gaagctcggt ggcacctgga 1020 

atttactgta ttcattgtcg ggcactgtcc actgtggcct ttcttagcat ttttacctgc 1080 

agaaaaactt tgtatggtac cactgtgttg gttatatggt gaatctgaac gtacatctca 1140 

ctggtataat tatatgtagc actgtgctgt gtagatagtt cctactggaa aaagagtgga 1200 

aatttattaa aatcagaaag tatgagatcc tgttatgtta agggaaatnc caaattccca 1260 

attttttttg gtctttttag gaaagatgtg ttgtggtaaa aagtgttagt ataaaaatga 1320 

taatttactt gtagtctttt atgattacac caatgtattc tagaaatagt tatgtcttag 1380 

gaaattgtgg tttaattttt gacttttaca ggtaagtgca aaggaaaagt ggtttcatga 1440 

aatgttctaa tgtataataa catttacctt cagcctccat ccagaatgga acggagtttt 1500 

gagtaatcca gggaagtata tctatatgat cttgatattg ttttataata atttgaagtc 1560 

taaaagactg catttttaaa caagttagta ttaatgcgtt ggcccacgta gcaaaaagat 1620 

atttgattat cttaaaaatt gttaaatacc gttttcatga aakttctcag tattgtaaca 1680 

gcaacttgtc aaacctaagc gatatttgaa tatgatctcc cataatttga aattgaaatc 1740 

gtattgtgtg gctctgtata ttctgttaaa aaattaaagg acagaaacct ttctttgtgt 1800 

atgcatgttt gaattaaaag aaagtaatgg aagaattgww mrawraaaaa aaaaaaaaaa 1860 
a 1861 

<210> 8 
<211> 204 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met Val Cys Gly Gly Phe Ala Cys Ser Lys Asn Cys Leu Cys Ala Leu 

15 10 15 

Asn Leu Leu Tyr Thr Leu Val Ser Leu Leu . Leu lie Gly lie Ala Ala 

20 25 30 

Trp Gly lie Gly Phe Gly Leu lie Ser Ser Leu Arg Val Val Gly Val 

35 40 45 

Val He Ala Val Gly He Phe Leu Phe Leu He Ala- Leu Val Gly Leu 

50 55 60 

He Gly Ala Val Lys His His Gin Val Leu Leu Phe Phe Tyr Met He 
65 70 75 80 

He Leu Leu Leu Val Phe He Val Gin Phe Ser Val Ser Cys Ala Cys . 

85 90 95 

Leu Ala Leu Asn Gin Glu Gin Gin Gly Gin Leu Leu Glu Val Gly Trp 

100 105 110 

Asn Asn Thr Ala Ser Ala Arg Asn Asp He Gin Arg Asn Leu Asn Cys 

115 120 125 

Cys Gly Phe Arg Ser Val Asn Pro Asn Asp Thr Cys Leu Ala Ser Cys 

130 135 140 

Val Lys Ser Asp His Ser Cys Ser Pro Cys Ala Pro He He Gly Glu 
145 150 155 160 

Tyr Ala Gly Glu Val Leu Arg Phe Val Gly Gly He Gly Leu Phe Phe 

165 170 175 

Ser Phe Thr Glu He Leu Gly Val Trp Leu Thr Tyr Arg Tyr Arg Asn 

180 185 190 

Gin Lys Asp Pro Arg Ala Asn Pro Ser Ala Phe Leu 
195 200 



wo 03/000012 



9 



PCT/US02/19773 



<210> 9 

<211> 3579 

<212> DNA 

<213> Homo ' sapiens 

<220> 

<221> misc^f eature 
<222> 3350, 3546 
<223> n = A, T,C or G 

<400> 9 

ccgaaaaaat cttagtgtct gcaaaacagg tgggtttaag atttattgat attagggaaa 60 
gtgaaattaa tgagctactc aagtttctgt ctttggtcat cactgagagt ctatttccat 120 
aggaagaagt ctttgcagag gaaattgaat gctgtctgat gctacaattc atatggatcc 180 
tttcctggat tgaagcctga cttttaaaaa aggtttcaat aaattgctta tacctatgaa 240 
gagatgcaaa agaaccttca aaataaagca aaggacttga agaaagagaa ggaagacatg 300 
aagaagagga tgtcatttag gtgaagggcc atgctggata aggagctggc tgcctctgtg 360 
aacatcctac tcaaggcatc ttcactgctg tacatccttt tgaaatccca gagatcttca 420 
gtctccctgt ggattaagga gatgtgcagt atttaaagtg gcttcaggaa ggcatggaag 480 
aggactgagt ggggaaagct ttttgtgcat gctgctggct acctccagcg gctgcctcca 540 
gcctccatca gctgcactct ggggaagagg aggctgcctt ctacctccca gcatctctgg 600 
atttcatgtt cctgtcagca cagaggagct aaatggcctg tagaggctga aggtctgagg 660 
ctcctaaagc tggaagaaaa ggctgggcca gtcaggccaa gcaagaacac wrwrywwsty 720 
gcctgaagtg ccttccatgg ttaaaggggg cctaaagcag gccacaaagg gccatgaagg 780 
aatggttaat atgtaacaga ctgaagggga agaaagccag tgaagatgaa gacttgccca 840 
tcttccttga agtcagtaag gcctgcctca ggtgcctagg atgtaattgc tctgctgctt 900 
ctcatgggga ggagtggccc tcatgacctt gtttacctgg aagagtgtgg gatgaatgcc 960 
tcctcctatg gggactcgca agtgctttag caaaaggata aattgctaat tgtggcattt 1020 
cgtggatcag caggattatt tctccttgct aaagaggatt ttgttggtcc tgaattctga 1080 
ggaggtggga ctaggaatgg gctccatgag cctgtgtatg actcagggaa tattaggact 1140 
ttggcacagc ctcatgggtt gggagtaagt cttggctctt ccctagcctg aatgacagac 1200 
atcagatcat tctggtgctt tgtccatgaa gatgtagatt ctgagcccac ccaactaatc 1260 
ttttcacttg agcacagaaa cagccccggg aatcggacag acccgtgtct ttcaggtttg 1320 
cttcacagag ccccaggggt tgacaatagg tgccttggag actgcctgca tggggatttt 1380 
taaaaagctt tctttgttaa aggtttgtaa accactcctc tgagcctgtt ttcattttat 1440 
agattattca gggaactgaa ctgcacagag atccagaaag tgggtagtgc aggctgtagt 1500 
gctgataact actgtactac ttggatcttt gtgctcccaa ataccaaatg gaagaggatc 1560 
tctgagagtc ctttgcaaag atcttgtagg gactttaggc tggggccttc ggaaaattcc 1620 
agaggattcc aatggagatt ttgagggact gactcagaag aacaaagaga atgataatgg 1680 
tgatgtccct gctttttaca acagatcatg ttctgatata tatgcaaatc tgtgtaaagt 1740 
aaaccctacc taaaatgtac tggggaccca agatggactg cctgtattgc ttccaggata 1800 
aagtccaatt tctagctctg gtttttataa ccttgcttca gctcaccttt tccgtcatca 1860 
tcccctccat ctcctctccc acgctgggaa atggatggct gcactatact gtgtgatgtt 1920 
attgctatgt tcatgccatc ccctctgcct ggaatgccct tctgcatgaa tgcctgtgaa 1980 
atgttgttgc tcctttgtat ggcctggctt ccgtggttgg caggaatctc ttctttcgtg 2040 
gtattcctgt catctttgtg catcacagtc agctttgtat tcctagcttg taagctactt 2100 
gaggataggg gcatgtctga atctatttaa tctcttgcac ctgtttggca aattgatgtt 2160 
ttaagtattt aaataactaa agctctctct acagtacata ctcacttttg atttatgaat 2220 
tggcaaaatt caactttttt ccttgaatat tcttaaagtg agatgaattc caaaggagag 2280 
tgttctgtgt gtggccttca ttgagtggtt ttctgttacc agaaagctct tggtggcctt 2340 
cctcttccct ggtgtcaagg ttgactgtta taggaaatgg gaggggagag ggccgtttct 2400 
gccacgcatt gtcctaggtt cttaacatta tttaatcctt ataatgcaat gttatcctca 2460 
ttttacagat gaaacctgag accaaagaac atgtaacaca taaagtacat tgcagagtta 2520 
ggatgtgaac ccaactctga ttctaaacct aatgctctca ctctttcatt cagaggttca 2580 
gtcagttctt tgtaggctgt agatccagag aagctgccgt agccaacaat aaagttgtta 2640 
gtttttaaaa catctatgtg gtaagttggt ctggcactta aaaatgtatt gtttcccagg 2700 
cacggtggtt cacacctgta atcccagcat tttgggaggc cgaggcaggc ggatcattag 2760 
gtcaaaagat tgagaccatc ctgaccaaca tggtgaaacc ccgtctctac taaaagttac 2820 
aaaaattagc tgggtgtggt ggcgcatgcc tctagtccca gctacctggg aggctgaggc 2880 
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aggagaattg 
ctacagcctg 
tatatattgt 
ttaactccaa 
agaggctagg 
cttaggaaat 
aagtattgta 
aaagaatcca 
aggagcatag 
gccattacat 
tgatttgata 
tttggnaagc 



cttgaaccca 
gcaacaaagc 
ttactactca 
cttacatgtt 
gctaggacac 
gtatgttttt 
attattctgt 
acttcatcaa 
cagagagaaa 
ctgtatatct 
aacaacctac 
aaaataaaaa 



ggaggcagag 
gagactctgt 
ccacagatct 
ttggtcacta 
agaccagtgt 
tgaatccata 
gaatagtaac 
atattaacgt 
cggtcaccat 
ggccatatca 
catactttat 
aaaaaamaaa 



gttgcagtga 
ctaaaatata 
gcaggagttc 
ttacaaactg 
ttcccatgtg 
atccctagaa 
acttaccatt 
accgagttga 
ctcattagcc 
gctgctaatg 
acaaatctta 
aaaaaaaag 



gccaagatca 
tatatatata 
actgatctct 
tcatcccaga 
ggaattccct 
aaatcagttg 
atggagacat 
aggctacaan 
ctatttttgg 
gtgagttctt 
tggtgttccg 



tgctactgca 
tatatatata 
aggatctgcc 
atgatgctgc 
cccagtattt 
aggaaatgag 
cactagtttg 
gaactgagac 
ttgttgtgat 
gcaaacaaaa 
agaaataaac 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3579 



<210> 10 

<211> 79 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met Asp Gly Cys Thr lie Leu Cys Asp Val lie Ala Met Phe Met Pro 

15 10 15 

Ser Pro Leu Pro Gly Met Pro Phe Cys Met Asn Ala Cys Glu Met Leu 

20 25 30 

Leu Leu Leu Cys Met Ala Trp Leu Pro Trp Leu Ala Gly He Ser Ser 

35 40 45 

Phe Val Val Phe Leu Ser Ser Leu Cys He Thr Val Ser Phe Val Phe 

50 55 60 

Leu Ala Cys Lys Leu Leu Glu Asp Arg Gly Met Ser Glu Ser He 
65 70 75 



<210> 11 

<211> 1076 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 708, 977, 1003, .1028, 1036 

<223> n = A, T,.C or G 



<400> 11 

tcgaccacgc 

tctcctccaa 

atcgcttttc 

tgcgggtaag 

ctggaaggcc 

gctgggcttc 

gtctgcgatc 

cctcggggct 

cacccacaag 

ttcttccttc 

tttttttttt 

acagatgcga 

tcaacctccc 

ctatcactct 

aaaaaaaaaa 

tgaacctgaa 

atggttacaa 



gtccggtgcc 
ggtctagtga 
ttgtgctacc 
gatgcaggta 
ctgaccttca 
accggcgccg 
ctgaatgggg 
ggtggcagca 
tatctcgata 
ttggcctaac 
ttgagatggg 
acatagtaca 
aagtaggatt 
ccccaacaac 
aagggcggcc 
acataaaatg 
ataaagncaa 



catctatcag 
cggagcccgc 
tgctgctctt 
agaaaaagtg 
tggccgtcgg 
gcatcgcggc 
gcggcgtgcc 
gcgtcgtcat 
gtgaggagga 
tcttccagtt 
ttctcactat 
ctgcagcctc 
acaagcatgc 
ctagatgtga 
gctagactag 
aatgcaattg 
ttagcatcac 



caggctccgg 
gcgcggcgcc 
cacttgcagt 
ctcggagagc 
aggaggactc 
caactcggtg 
cgccgggggg 
aggtaatatt 
tgaggagtag 
aggatctaga 
attgtccagg 
caactcctag 
gccgacgatg 
aaacagaata 
tctagagaaa 
ttgttgttaa 
aaatttcaca 



gctgaagatt 
accatgcggc 
ggggtggagg 
tcggacagcg 
gcagtcgccg 
gctgcctcgc 
ctagtggcca 
ggtgccctga 
ccagcagctc 
actttgcctt 
ctagagtgca 
cctcaggnga 
cccagaatcc 
aacttcaccc 
aaacctccca 
cttgtttatt 
aanaaaggca 



gcttctcttc 
agaaggcggt 
caggtgagaa 
gctccgggtt 
ggctgcccgc 
tgatgagctg 
cgctgcagag 
tgggctacgc 
ccagaacctc 
tttttttttt 
gtggctattc 
tcctcctgtc 
agaactttgt 
agaaaacaaa 
cacctccccc 
gcagcttata 
tttttttcac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 



wo 03/000012 



11 



PCT/US02/19773 



tgcattcnta gttggngggt ttggtccaaa actcatcaaa tggtatcttt atcatg 107 6 

<210> 12 
<211> 138 
<212> PRT 

<213> Homo sapiens 

<400> 12 



Met 


Arg 




ijys 


Ala 


Val 


Ser 


Leu 


rfie 


Leu 


Cys 


Tyr 


Leu 


Leu Leu 


Phe 


1 








5 


















15 




Thr Cys 


OCT J. 


o J. y 


Val 


Glu 


Ala 


Gly 




7\ O Tt 

Moll 


M±a 


\j±y 


Lys 


Asp Ala 


Gly 








20 










25 










30 




Lys 


Lys 


Lys 


Cys 


Ser 


Glu 


Ser 


Ser 


Asp 


Ser 


Gly 


Ser 


Gly 


Phe Trp 


Lys 






35 










40 










45 


Ala 


Leu 


Thr 


Phe 


Met 


Ala 


Val 


Gly 


Gly 


Gly 


Leu 


Ala 


Val 


Ala Gly 


Leu 




50 










55 










60 






Pro 


Ala 


Leu 


Gly 


Phe 


Thr 


Gly 


Ala 


Gly 


He 


Ala 


Ala 


Asn 


Ser Val 


Ala 


65 










70 










75 








80 


Ala 


Ser 


Leu 


Met 


Ser 


Trp 


Ser 


Ala 


He 


Leu 


Asn 


Gly 


Gly 


Gly Val 


Pro 










85 










90 








95 




Ala 


Gly 


Gly 


Leu 


Val 


Ala 


Thr 


Leu 


Gin 


Ser 


Leu 


Gly 


Ala 


Gly. Gly 


Ser 








100 










105 










110 




Sex 


Val 


Val 


lie 


Gly Asn 


He 


Gly 


Ala 


Leu 


Met 


Gly 


Tyr 


Ala Thr 


His 






115 










120 










125. 






Lys 


Tyr 


Leu 


Asp 


Ser 


Glu 


Glu Asp 


Glu 


Glu 













130 135 



<210> 13 
<211> 1352 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 1139, 1140, .1150, 1155, 1166, 1171, 1181, 1186, 1189, 1212, 
1214, 1252, 1311 
<223> n = A, T,C or G 

<400> 13 

tcaccacgcg tccggaagct ccgggtgtcg cgggggcggg aggaattaag ggagggagag 60 
aggcgcgcgg gtgaaaggcg cattgatgca gcctgcggcg gcctcggagc gcggcggagc 120 
cagacgctga ccacgttcct ctcctcggtc tcctccgcct ccagctccgc gctgcccggc 180 
agccgggagc catgcgaccc cagggccccg ccgcctcccc gcagcggctc cgcggcctcc 240 
tgctgctcct gctgctgcag ctgcccgcgc cgtcgagcgc ctctgagatc cccaagggga 300 
agcaaaaggc gcagctccgg cagagggagg tggtggacct gtataatgga atgtgcttac 360 
aagggccagc aggagtgcct ggtcgagacg ggagccctgg ggccaatggc attccgggta 420 
cacctgggat cccaggtcgg gatggattca aaggagaaaa gggggaatgt ctgagggaaa 4 80 
gctttgagga gtcctggaca cccaactaca agcagtgttc atggagttca ttgaattatg 540 
gcataaatct tgggaaaatt gcggagtgta catttacaaa gatgcgttca aatagtgctc 600 
taagagtttt gttcagtggc tcacttcggc taaaatgcag aaatgcatgc tgtcagcgtt 660 
ggtatttcac attcaatgga gctgaatgtt caggacctct tcccattgaa gctataattt 720 
atttggacca aggaagccct gaaatgaatt caacaattaa tattcatcgc acttcttctg 780 
tggaaggact ttgtgaagga attggtgctg gattagtgga tgttgctatc tgggttggca 840 
cttgttcaga ttacccaaaa ggagatgctt ctactggatg gaattcagtt tctcgcatca 900 
ttattgaaga actaccaaaa taaatgcttt aattttcatt tgctacctct ttttttatta 960 
tgccttggaa tggttcactt aaatgacatt ttaaataagt ttatgtatac atctgaatga 1020 
aaagcaaagc taaatatgtt tacagaccaa agtgtgattt ccccctgttt ttaaatctag 1080 
cattattcat tttgcttcaa tcaaaagtgg tttcaatatt ttttttagtt ggttagaann 1140 
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ctttcttcan agtcncattc tctcanccta naatttggaa nattgntgng gtcttttgtt 1200 

ttttctctta gnanagcatt tttaaaaaaa tataaaagct accaatcttt gnacaatttg 1260 

taaatgttaa gaattttttt tatatctgtt aaataaaaat tatttccacc naaaaaaaaa 1320 

aaaaaaaaaa aaaaaaaaaa gggcggccgc ta 1352 



<210> 14 
<211> 243 
<212> PRT 

<213> Homo sapiens 



<400> 14 



Met 


Arg 


Pro 


bin 


Gly 


Pro 


Ala 


Ala 


1 








c 
D 








Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Gin 


Leu 


















lie 


Pro Lys 


Gly 


Lys 


Gin 


Lys 


Ala 






35 












Asp 


Leu 


Tyr 


Asn 


Gly 


Met 


Cys 


Leu 




50 










Ob 




Arg Asp Gly 


Ser 


Pro 


Gly 


Ala 


Asn 


65 










1 0 






Pro 


Gly Arg 


Asp 


Gly 


Phe 


Lys 


Gly 










85 








Ser 


Phe 


Glu 


Glu 


Ser 


Trp 


Thr 


Pro 








lUU 










Ser 


Leu 


Asn 


Tyr 


Gly 


He 


Asn 


Leu 






115 










120 


Thr 


Lys 


Met 


Arg 


Ser 


Asn 


Ser 


Ala 




130 










135 




Leu 


Arg 


Leu 


Lys 


Cys 


Arg 


Asn 


Ala 


145 










150 






Phe 


Asn 


Gly 


Ala 


Glu 


Cys 


Ser 


Gly 










165 








Tyr 


Leu Asp 


Gin 


Gly 


Ser 


Pro 


Glu 








180 










Arg 


Thr 


Ser 


Ser 


Val 


Glu 


Gly 


Leu 






195 










200 


Val 


Asp 


Val 


Ala 


He 


Trp 


Val 


Gly 




210 










215 




Asp Ala 


Ser 


Thr 


Gly 


Trp 


Asn 


Ser 


225 










230 






Leu 


Pro 


Lys 













C <^ -K- 

ber 


Pro 


bin 


Arg 


Leu 


Arg 


biy 


Leu 




1 n 










1 D 




Pro 


Ala 


Pro 


ber 


Ser 


Ala 


Ser 


Glu 












oU 






bin 


Leu 


Arg 


bin 


Arg 


Glu 


Val 


Val 










4 5 








Gin 


Gly 


Pro 


Ala 


Gly 


Val 


Pro 


Gly 








ou 










Gly 


He 


Pro 


Gly 


Thr 


Pro 


Gly 


He 






/ D 










Q C\ 

oU 


Glu 


Lys 


Gly 


Glu 


Cys 


Leu 


Arg 


Glu 




90 










95 




Asn 


Tyr 


Lys 


Gin 


Cys 


Ser 


Trp 


Ser 


105 










110 






Gly 


Lys 


He 


Ala 


Glu 


Cys 


Thr 


Phe 










125 








Leu 


Arg 


Val 


Leu 


Phe 


Ser 


Gly 


Ser 








140 










Cys 


Cys 


Gin 


Arg 


Trp 


Tyr 


Phe 


Thr 






155 










160 


Pro 


Leu 


Pro 


He 


Glu 


Ala 


He 


He 




170 










175 




Met 


Asn 


Ser 


Thr 


He 


Asn 


He 


His 


185 










190 






Cys 


Glu 


Gly 


He 


Gly 


Ala 


Gly 


Leu 










205 








Thr 


Cys 


Ser 


Asp 


Tyr 


Pro 


Lys 


Gly 








220 










Val 


Ser 


Arg 


He 


He 


He 


Glu 


Glu 






235 










240 



<210> 15 

<211> 2142 

<212> DNA 

<213> Homo sapiens 

<400> 15 

ggagtcgacc cacgcgtccg 
cctctggcca ctcagccggg 
ccggcgggca gggcggggca 
cgggcctctc ccttctgccy 
tggggctccc gcgcgggtcg 
ggctcctctt tgccctgtac 
ccagggacac cacatcattt 
gaaagggcca ggcctgccga 



ccccggggga cccgccgccc 
gccgagaggg agctgccggg 
gggcaagacg gccgcctccg 
srgrcgtcag cggacsgggc 
ttcttctggc tgctgctcct 
ttctcggcgg tgcagcggta 
gaagcattct ttcaatccaa 
cacctgcttc acctggccat 



agctcccgag ggtgcggcag 60 
cggggaggcg ccgcaggcac 120 
caagtgccac ccggcccacc 180 
gctcgcgggc cggggctgta 240 
gctcacggct gcctgctcgg 300 
cccggggcca gcggccggag 360 
ggcatcgaat tcttggacag 420 
tcagcggcac ccccacttcc 480 
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gtggcctgtt caatctctcc attccagtgc tgctgtgggg ggacctcttc accccagcgc 540 
tctgggaccg cctgagccaa cacaaagccc cgtatggctg gcgggggctc tctcaccaag 600 
tcatcgcctc caccctgagc cttctgaacg gctcagagag tgccaagctg tttgccccgc 660 
ccagggacac ccctccaaag tgtatccggt gtgccgtggt gggcaacgga ggcattctga 720 
atgggtcccg ccagggtccc aacatcgatg cccatgacta tgtattcaga ctcaatggag 780 
ctgtgatcaa aggcttcgag cgcgatgtgg gcaccaagac ttccttctat ggtttcactg 840 
tgaacacgat gaagaactcc ctcgtctcct actggaatct gggcttcacc tccgtgccac 900 
aaggacagga cctgcagtat atcttcatcc cctcagacat ccgcgactat gtgatgctga 960 
gatcggccat tctgggcgtg cctgtccctg agggcctaga taaaggggac aggccgcacg 1020 
cctattttgg accagaagcc tctgccagta aattcaagct gctacatccg gacttcatca 1080 
gctacctgac agaaaggttc ttgaaatcaa agttgattaa cacacatttt ggagacctat 1140 
atatgcctag taccggggct ctcatgctgc tgacagcttt gcatacctgt gaccaggtca 1200 
gtgcctatgg attcatcaca agcaactact ggaaattttc cgaccactat ttcgaacgaa 1260 
aaatgaagcc attgatattt tatgcaaacc acgatctgtc cctggaagct gccctgtgga 1320 
gggacctgca caaggccggc atccttcagc tgtaccagcg ctgaccccaa tgcactgagc 1380 
cctttgcttc ttcaagagtt gcggcctgat cctctcaagt ggccaaaagc ttttttaact 1440 
tttcaatctt caccttccct tgccaacaga gggcactggg gtgaattcaa gattttcatc 1500 
gaggtctgtt caatatagga caccccagct tgtccttggc tcatccaaga actcttctgt 1560 
atctaaaaca atacatctca atcttggcca agggaaaacg gactgctttg ctggattggc 1620 
actgagcaac tttaggaaat gtcggtggag tgttcagcaa gatcagacag cagtccaggt 1680 
caaaggcaaa cacacacgct ccagcccaaa tcctcctggt ggcacatcct accccagatg 1740 
ctaaagtgat tcaaggactc caggacacct cttaagagcc tttctaagaa catgataggc 1800 
ttacttctgc tccataataa agtgggagaa aaaagccaga atataaaact taaractaga 1860 
taactgcgya satgatggac catttttttt ttttggctgg gtagagaaat catataaaac 1920 
gcaggctgtt tagcatggag atgactctca gaacactggr agggtctggc acttgatggg 1980 
ggttagttgc ttggcagcct gcctgaagtc ccattagaga tgtatcaccc ccttgtcacc 2040 
aacaggatga tgtccccagg taataaacct tcatcctcat aaaaaaaaaa aaaaaaaaaa 2100 
aaaaaaaaaa aaaaaaaaaa aagggcggcc gctagactag tc 2142 



<210> 16 

<211> 374 
<212> PRT 

<213> Homo sapiens 



<400> 16 



Met 


Gly 


Leu 


Pro 


1 








Ala 


Ala 


Cys 


Ser 








20 


Arg 


Tyr 


Pro 


Gly 






35 




Ala 


Phe 


Phe 


Gin 




50 






Ala 


Cys 


Arg 


His 


65 








Arg 


Gly 


Leu 


Phe 


Phe 


Thr 


Pro 


Ala 








100 


Gly 


Trp 


Arg 


Gly 






115 




Leu 


Asn 


Gly 


Ser 




130 






Pro 


Pro 


Lys 


Cys 


145 








Asn 


Gly 


Ser 


Arg 


Arg 


Leu 


Asn 


Gly 








180 



Arg Gly Ser Phe 
5 

Gly Leu Leu Phe 

Pro Ala Ala Gly 
40 

Ser Lys Ala Ser 
55 

Leu Leu His Leu 
70 

Asn Leu Ser lie 
85 

Leu Trp Asp Arg 

Leu Ser His Gin 
120 

Glu Ser Ala Lys 
135 

He Arg Cys Ala 
150 

Gin Gly Pro Asn 
165 

Ala Val He Lys 



Phe 


Trp 


Leu 


Leu 




10 






Ala 


Leu 


Tyr 


Phe 


25 








Ala 


Arg 


Asp 


Thr 


Asn 


Ser 


Trp 


Thr 








60 


Ala 


He 


Gin 


Arg 






75 




Pro 


Val 


Leu 


Leu 




90 






Leu 


Ser 


Gin 


His 


105 








Val 


He 


Ala 


Ser 


Leu 


Phe 


Ala 


Pro 








140 


Val 


Val 


Gly 


Asn 






155 




He 


Asp 


Ala 


His 




170 






Gly 


Phe 


Glu 


Arg 



Leu Leu Leu Thr 
15 

Ser Ala Val Gin 
30 

Thr Ser Phe Glu 
45 

Gly Lys Gly Gin 

His Pro His Phe 
80 

Trp Gly Asp Leu 
95 

Lys Ala Pro Tyr 
110 

Thr Leu Ser Leu 

125 

Pro Arg Asp Thr 

Gly Gly He Leu 
160 

Asp Tyr Val Phe 
175 

Asp Val Gly Thr 
190 
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Lys 


Thr 


Ser 

1 yD 


Phe 


Tyr 


Gly 


Phe 


Thr 


Val 


Asn 


Thr 


Met 


Lys 

one 
205 


Asn 


Ser 


Leu 


Val 


Ser 
210 


1*1/ V 

iyr 


irp 


Asn 


Leu 


vjxy 

^ ID 


irne 


1 nr 


oer 


vai 


Pro 
ZZU 


vjxn 


Caiy 


bin 


TV 

ASp 


Leu 


Gin 


Tyr 


lie 


Phe 


He 


Pro 


Ser 


Asp 


He 


Arg 


Asp 


Tyr 


Val 


Met 


Leu 


225 










^ JU 










9 R 










O /I A 


Arg 


Ser 


Ala 


He 


Leu 


Gly 


Val 


Pro 


Val 


Pro 

0 c: n 


Glu 


Gly 


Leu 


Asp 


Lys 
o c c 


Gly 


Asp Arg 


rro 


nXS 


AX a 


Tyr 


Q Vx r-v 

rne 


biy 


Pro 




n 1 a 

/ixa 


Ser 


Ala 


Ser 


i»ys 


rne 


















Zoo 










OTA 






Lys 


Leu 


Leu 


His 


Pro 


Asp 


Phe 


He 
0 Q n 


Ser 


Tyr 


Leu 


Thr 


Glu 

O Q R 


Arg 


Phe 


Leu 


Lys 


Ser 
290 


Lys 


Leu 


He 


Asn 


Thr 


His 


Phe 


Gly 


Asp 


Leu 
"5 A 


Tyr 


Met 


Pro 


Ser 


Thr Gly 


Ala 


Leu 


Met 


Leu 


Leu 


Thr 


Ala 


Leu 


His 


Thr 


Cys 


Asp 


Gin 


Val 


305 




















Jib 










320 


Ser 


Ala 


T \/ -r 

1 yr 


o JL y 


325 


± ± 6 


J. niT 




n c r» 


i yr 
330 


1 rp 


Lys 


tr ne 


oer 


Asp 
335 


his 


Tyr 


Phe 


Glu 


Arg 
340 


Lys 


Met 


Lys 


Pro 


Leu 
345 


He 


Phe 


Tyr 


Ala 


Asn 
350 


His 


Asp 


Leu 


Ser 


Leu 
355 


Glu 


Ala 


Ala 


Leu 


Trp 
360 


Arg 


Asp 


Leu 


His 


Lys 
365 


Ala 


Gly 


He 


Leu 


Gin 


Leu 


Tyr 


Gin 


Arg 























370 



<210> 17 

<211> 1707 

<212> DNA 

<213> Homo sapiens 

<400> 17 

tacttaggga gtcgaccacg 
ccaggtgttg gaggcctttg 
ccgccgccac ggctacagaa 
acccgccgat tgtggcctcc 
agcgctggca ggcgacggtg 
ccaagatgca gtttatgaag 
gctggtacca gtacttcacc 
agcccgagcc cgagcccgaa 
cggtcgcctg cgactgcctg 
accgttacga ggagagcgag 
tcgtgggcct cctcagccca 
gcccagttca tttttactgg 
gtcgaattga tgacttgcga 
ccaagcaact cgcagagttt 
taaaatgtaa accagacaaa 
ttggctcaaa aactgcagat 
acaaattaag aagggaaagg 
ttagagctaa tgctattgca 
gattctggag tgaagcagat 
gaaaatactt ttcctttttc 
atcaaaataa ccctcgtaaa 
caattgagga taatgatgtg 
ttctactgaa tagaccaaaa 
ttgattgaga actgtgggaa 
taactgtcaa ctattaaata 
attagatctc ttactctgct 
ttgatttaaa aataattgca 



cgtccgacta gttctagatc 
ctacgcggtc cgaggctttc 
acgacctgcc aagacgtcgc 
atgacagccg acagcaaagc 
cacgctgcgg agtcggtaga 
tacatggttt acccgcagac 
aagaccgtgt tcctgtcggg 
cccgaacctg aacctgcgct 
ctgcaggagc acttctacct 
gtcatatctt tgcccttcct 
cacaacccgg ccctggccgc 
gtgcgtggtg aagaaattat 
taccagatag atgataaacc 
gtgccattgg attattctgt 
cttccattat tcaaacggca 
ccttgctgtt acggtcacac 
cttttgagac aaaactgtgc 
agcctttttg cttggactgg 
gttactcgac cttttgtctc 
tgctaccagc taaatacttt 
aatatatgtt ggggtacaca 
aaaggtttta atgatgatgt 
gaagaaaaat cacagctgtt 
tatttaaatt ttactgaagg 
cattgatttt tgagacaaat 
caaattcatc actgaaagat 
tttgtatatt gctaactgat 



gcgggcaaag atggcggcgg 60 
attgcacacc gcggctaatg 120 
ggcgaccccc gtcgcgcggt 180 
tgcacggctg cggcggatcg 240 
cgagaagctg cgaatcctca 300 
cttcgcgctg aatgccgacc 360 
tctgccgccg cccccagcgg 420 
ggacctcgcg gcgctgcgtg 4 80 
gcggcgcagg cggcgcgtgc 540 
ggatcagctg gtgtcaaccc 600 
tgccgccctc gattatagat 660 
tcctcgtggt catcgaagag 720 
aaacaaccag attcgaatat 780 
tcctatagaa atccccacta 840 
gtatgaaaac cacatatttg 900 
ccagtttcat ctgttacctg 960 
tgatcagata gaagttgttt 1020 
agcacaagct atgtatcaag 1080 
ccaggctgtg atcacagatg 1140 
ggcactgact acacaagctg 1200 
aagtaagcct ctttatgaaa 1260 
tctacttcag atagttcact 1320 
ggaaaactga aaaagcatat 1380 
aacaataatg atgagatttg 1440 
aaaaaaaatg tcaacctgtt 1500 
ttaattttag ttaccttttg 1560 
aagacaaatt gagttattga 1620 
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gctattaaat gcacatttta atataaatgc agaaatccca aataaaatgc taacatactg 1680 
aattcagtaa ttaaaagaac ccactgc 1707 

<210> 18 
<211> 439 
<212> PRT 

<213> Homo sapiens 



<400> 18 



Met 


Ala 


Ala 


Ala 


Arg 


Cys 


Trp 


Arg 


Pro 


Leu 


Leu 


Arg 


Gly 


Pro 


Arg 


Leu 


1 








5 










10 










15 




Ser 


Leu 


His 


Thr 


Ala 


Ala 


Asn 


Ala 


Ala 


Ala 


Thr 


Ala 


Thr 


Glu 


Thr 


Thr 








20 










25 










30 






Cys 


Gin 


Asp 


Val 


Ala 


Ala 


Thr 


Pro 


Val 


Ala 


Arg 


Tyr 


Pro 


Pro 


He 


Val 






35 










40 










45 








Ala 


Ser 


Met 


Thr 


Ala 


Asp 


Ser 


Lys 


Ala 


Ala 


Arg 


Leu 


Arg 


Arg 


He 


Glu 




50 










55 










60 










Arg 


Trp 


Gin 


Ala 


Thr 


Val 


His 


Ala 


Ala 


Glu 


Ser 


Val 


Asp 


Glu 


Lys 


Leu 


65 










70 










75 










80 


Arg 


He 


Leu 


Thr 


Lys 


Met 


Gin 


Phe 


Met 


Lys 


Tyr 


Met 


Val 


Tyr 


Pro 


Gin 










85 










90 










95 




Thr 


Phe 


Ala 


Leu 


Asn 


Ala 


Asp 


Arg 


Trp 


Tyr 


Gin 


Tyr 


Phe 


Thr 


Lys 


Thr 








100 










105 










110 






Val 


Phe 


Leu 


Ser 


Gly 


Leu 


Pro 


Pro 


Pro 


Pro 


Ala 


Glu 


Pro 


Glu 


Pro 


Glu 






115 










120 










125 








Pro 


Glu 


Pro 


Glu 


Pro 


Glu 


Pro 


Ala 


Leu 


Asp 


Leu 


Ala 


Ala 


Leu 


Arg 


Ala 




130 










135 










140 










Val 


Ala 


Cys 


Asp 


Cys 


Leu 


Leu 


Gin 


Glu 


His 


Phe 


Tyr 


Leu 


Arg 


Arg 


Arg 


145 










150 










155 










160 


Arg 


Arg 


Val 


His 


Arg 


Tyr 


Glu 


Glu 


Ser 


Glu 


Val 


He 


Ser 


Leu 


Pro 


Phe 










165 










170 










17 5 




Leu Asp 


Gin 


Leu 


Val 


Ser 


Thr 


Leu 


Val 


Gly 


Leu 


Leu 


Ser 


Pro 


His 


Asn 








180 










185 










190 






Pro 


Ala 


Leu 


Ala 


Ala 


Ala 


Ala 


Leu 


Asp 


Tyr 


Arg 


Cys 


Pro 


Val 


His 


Phe 






195 










200 










205 








Tyr 


Trp 


Val 


Arg 


Gly 


Glu 


Glu 


He 


He 


Pro 


Arg 


Gly 


His 


Arg 


Arg 


Gly 




210 










215 










o 1 r\ 

220 










Arg 


He 


Asp 


Asp 


Leu 


Arg 


Tyr 


Gin 


He 


Asp 


Asp 


Lys 


Pro 


Asn 


Asn 


Gin 


225 










OTA 

230 










235 










24U 


He 


Arg 


He 


Ser 


Lys 


Gin 


Leu 


Ala 


Glu 


Pne 


Val 


Pro 


Leu 


Asp 


Tyr 


Ser 










245 










o c r\ 

250 










o c; c 
255 




Val 


Pro 


He 


Glu 


He 


Pro 


Thr 


He 


Lys 


Cys 


Lys 


Pro 


Asp 


Lys 


Leu 


Pro 








260 










265 










OTA 
2 fU 






Leu 


Phe 


Lys 


Arg 


Gin 


Tyr 


Glu 


Asn 


His 


T 1 >^ 

He 


Pne 


Val 


Gly 


C> A 

Ser 


Lys 


Tnr 






275 










280 










285 








Ala Asp 


Pro 


Cys 


Cys 


Tyr 


Gly 


His 


Thr 


Gin 


Phe 


His 


Leu 


Leu 


Pro 


Asp 




290 










295 










300 










Lys 


Leu 


Arg 


Arg 


Glu 


Arg 


Leu 


Leu 


Arg 


Gin 


Asn 


Cys 


Ala 


Asp 


Gin 


He 


305 










310 










315 










320 


Glu 


Val 


Val 


Phe 


Arg 


Ala 


Asn 


Ala 


He 


Ala 


Ser 


Leu 


Phe 


Ala 


Trp 


Thr 










325 










330 










335 




Gly Ala 


Gin 


Ala 


Met 


Tyr 


Gin 


Gly 


Phe 


Trp 


Ser 


Glu 


Ala 


Asp 


Val 


Thr 








340 










345 










350 






Arg 


Pro 


Phe 


Val 


Ser 


Gin 


Ala 


Val 


He 


Thr 


Asp 


Gly 


Lys 


Tyr 


Phe 


Ser 






355 










360 










365 








Phe 


Phe 


Cys 


Tyr 


Gin 


Leu 


Asn 


Thr 


Leu 


Ala 


Leu 


Thr 


Thr 


Gin 


Ala 


Asp 




370 










375 










380 










Gin 


Asn 


Asn 


Pro 


Arg 


Lys 


Asn 


He 


Cys 


Trp 


Gly 


Thr 


Gin 


Ser 


Lys 


Pro 


385 










390 










395 










400 
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Leu Tyr Glu Thr 

Val Leu Leu Gin 
420 

Lys Ser Gin Leu 
435 



lie Glu Asp Asn 

405 

lie Val His Phe 
Leu Glu Asn 



Asp Val Lys Gly 
410 

Leu Leu Asn Arg 
425 



Phe Asn Asp Asp 
415 

Pro Lys Glu Glu 
430 



<210> 19 

<211> 2844 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> 767, 2839, 2842 
<223> n = A,T,C or G 

<400> 19 

cgacccacgc cgtccgggcg gcggcgtccg caggagcccg ggaggcggag gcgggaggcg 60 

gcggcggcgc gcggagacgc agcagcggca gcggcagcat gtcggccggc ggagcgtcag 120 

tcccgccgcc cccgaacccc gccgtgtcct tcccgccgcc ccgggtcacc ctgcccgccg 180 

gccccgacat cctgcggacc tactcgggcg ccttcgtctg cctggagatt ctgttcgggg 240 

gtcttgtctg gattttggtt gcctcctcca atgttcctct acctctacta caaggatggg 300 

tcatgtttgt gtccgtgaca gcgtttttct tttcgctcct ctttctgggc atgttcctct 360 

ctggcatggt ggctcaaatt gatgctaact ggaacttcct ggattttgcc taccatttta 420 

cagtatttgt cttctatttt ggagcctttt tattggaagc agcagccaca tccctgcatg 480 

atttgcattg caatacaacc ataaccgggc agccactcct gagtgataac cagtataaca 54 0 

taaacgtagc agcctcaatt tttgccttta tgacgacagc ttgttatggt tgcagtttgg 600 

gtctggcttt acgaagatgg cgaccgtaac actccttaga aactggcagt cgtatgttag 660 

tttcacttgt ctactttata tgtctgatca atttggatac cattttgtcc agatgcaaaa 720 

acattccaaa agtaatgtgt ttagtagaga gagactctaa gctcaangtt ctggtttatt 780 

tcatggatgg aatgttaatt ttattatgat attaaagaaa tggcctttta ttttacatct 840 

ctcccctttt tccctttccc cctttatttt cctccttttc tttctgaaag tttcctttta 900 

tgtccataaa atacaaatat' attgttcata aaaaattagt atcccttttg tttggttgct 960 

gagtcacctg aaccttaatt ttaattggta attacagccc ctaaaaaaaa cacatttcaa 1020 

ataggcttcc cactaaactc tatattttag tgtaaaccag gaattggcac acttttttta 1080 

gaatgggcca gatggtaaat atttatgctt cacggtccat acagtctctg tcacaactat 114 0 

tcagttctgc tagtatagcg tgaaagcagc tatacacaat acagaaatga atgagtgtgg 1200 

ttatgttcta ataaaactta tttataaaaa caaggggagg ctgggtttag cctgtgggcc 1260 

atagtttgtc aaccactggt gtaaaacctt agttatatat gatctgcatt ttcttgaact 1320 

gatcattgaa aacttataaa cctaacagaa aagccacata atatttagtg tcattatgca 1380 

ataatcacat tgcctttgtg ttaatagtca aatacttacc tttggagaat acttaccttt 14 4 0 

ggaggaatgt ataaaatttc tcaggcagag tcctggatat aggaaaaagt aatttatgaa 1500 

gtaaacttca gttgcttaat caaactaatg atagtctaac aactgagcaa gatcctcatc 1560 

tgagagtgct taaaatggga tccccagaga ccattaacca atactggaac tggtatctag 1620 

ctactgatgt cttactttga gtttatttat gcttcagaat acagttgttt gccctgtgca 1680 

taatataccc atatttgtgt gtggatatgt gaagcttttc caaatagagc tctcagaaga 174 0 

attaagtttt tacttctaat tattttgcat tactttgagt taaatttgaa tagagtatta 1800 

aatataaagt tgtagattct tatgtgtttt tgtattagcc cagacatctg taatgttttt 1860 

gcactggtga cagacaaaat ctgttttaaa atcatatcca gcacaaaaac tatttctggc 1920 

tgaatagcac agaaaagtat tttaacctac ctgtagagat cctcgtcatg gaaaggtgcc 1980 

aaactgtttt gaatggaagg acaagtaaga gtgaggccac agttcccacc acacgagggc 2040 

ttttgtattg ttctactttt tcagcccttt actttctggc tgaagcatcc ccttggagtg 2100 

ccatgtataa gttgggctat tagagttcat ggaacataga acaaccatga atgagtggca 2160 

tgatccgtgc ttaatgatca agtgttactt atctaataat cctctagaaa gaaccctgtt 2220 

agatcttggt ttgtgataaa aatataaaga cagaagacat gaggaaaaac aaaaggtttg 2280 

aggaaatcag gcatatgact ttatacttaa catcagatct tttctataat atcctactac 234 0 

tttggttttc ctagctccat* accacacacc taaacctgta ttatgaatta catattacaa 2400 
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agtcataaat 
gaatttaggt 
aattggtttc 
acaaaccaag 
acttgtgagt 
gtcttagctg 
aaaagaaatt 
aaaaaaaaaa 



gtgccatatg 
gtgagatttt 
tttaaaatgc 
cctaagacac 
tgaatgacaa 
taaaaatgag 
tattttatac 
aaaaaaaang 



gatatacagt 
ttgtttccca 
tttggtggca 
atctgtgaat 
agcagttgaa 
aaagtgttgg 
gtgttatgtc 
tnhg 



acattctagt 
ggtatagcag 
cttttgtaaa 
acttagattt 
caaaaattat 
ttggttttaa 
tctaataaag 



tggaatcgtt 
gcttatgttt 
cagattgctt 
gtagcttaat 
ggcatttaag 
aatctggtaa 
tattcatttg 



<210> 20 
<211> 176 
<212> PRT 

<213> Homo sapiens 



tactctgcta 
ggtggcatta 
ctagattgtt 
cacattctag 
aatttaacat 
ctccatgatg 
ataaaaaaaa 



2460 
2520 
2580 
2640 
2700 
2760 
2820 
2844 



<400> 20 



Met 


Ser 


Ala 


Gly 


Gly 


Ala 


Ser 


Val 


Pro 


Pro 


Pro 


Pro 


Asn 


Pro Ala 


Val 


1 








5 










10 








15 




Ser 


Phe 


Pro 


Pro 


Pro 


Arg 


Val 


Thr 


Leu 


Pro 


Ala Gly 


Pro 


Asp He 


Leu 








20 










25 










30 




Arg 


Thr 


Tyr 


Ser 


Gly 


Ala 


Phe 


Val 


Cys 


Leu 


Glu 


He 


Leu 


Phe Gly Gly 






35 










40 










45 






Leu 


Val 


Trp 


He 


Leu 


Val 


Ala 


Ser 


Ser 


Asn 


Val 


Pro 


Leu 


Pro Leu 


Leu 




50 










55 










60 








Gin 


Gly 


Trp 


Val 


Met 


Phe 


Val 


Ser 


Val 


Thr 


Ala 


Phe 


Phe 


Phe Ser 


Leu 


65 










70 










75 








80 


Leu 


Phe 


Leu Gly 


Met 


Phe 


Leu 


Ser Gly 


Met 


Val 


Ala 


Gin 


He Asp Ala 










85 










90 








95 




Asn 


Trp 


Asn 


Phe 


Leu 


Asp 


Phe 


Ala 


Tyr 


His 


Phe 


Thr 


Val 


Phe Val 


Phe 








100 










105 










110 




Tyr 


Phe 


Gly Ala 


Phe 


Leu 


Leu 


Glu 


Ala 


Ala 


Ala 


Thr 


Ser 


Leu His 


Asp 






115 










120 










125 




Leu 


His 


Cys 


Asn 


Thr 


Thr 


He 


Thr 


Gly 


Gin 


Pro 


Leu 


Leu 


Ser Asp 


Asn 




130 










135 










140 








Gin 


Tyr 


Asn 


He 


Asn 


Val 


Ala 


Ala 


Ser 


He 


Phe 


Ala 


Phe 


Met Thr 


Thr 


145 










150 










155 








160 


Ala 


Cys 


Tyr 


Gly 


Cys 


Ser 


Leu 


Gly Leu 


Ala 


Leu Arg 


Arg 


Trp Arg 


Pro 










165 










170 








175 





<210> 21 

<211> 12642 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 7145, 7158, 7460, 7461, 7462, 7463, 7467, 7717, 7756, 7795, 
7799, 7803, 7818, 7822, 7833, 7842, 7843, 7852, 7860, 7864, 
7871, 7876, 8141, 11251, 11283, 11294, 11301, 11309, 11336, 
11341, 11345, 11352, 11357, 11363, 11373, 11380, 11391 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 11399, 11402, 11412, 11424, 11427, 11428, 11435, 11445, 11461, 
11472, 11478, 11488, 11490, 11497, 11519, 11527, 11548, 

11551, 12281, 12298, 12394, 12615, 12617, 12618, 12620, 
12621, 12624, 12627, 12628, 12629, 12633, 12634, 12635 
<223> n « A,T,C or G 



wo 03/000012 



18 



PCT/US02/19773 



<221> misc_feature 

<222> 12636, 12637, 12640 

<223> n = A,T,C or G 



<400> 21 

atggcgagcc 

cccggcgccc 

tgtggttata 

gagaaaccaa 

gggagagcct 

cactgcatcg 

aacgtctacg 

gtcgtcactg 

tatcaggtga 

gaggtccggg 

gtggaggtga 

cagcatgaca 

cgtgtggtca 

agcgtcagca 

gcggagctga 

ggggccacat 

atcctgaagg 

gactctccca 

ctcctcacac 

accaagtgtg 

cggcagctga 

aacctcaccg 

atccagacct 

cgactcttgc 

gaggaagacg 

aagatctaca 

atcaactaca 

aaagtgttca 

acctattcct 

cggattgcca 

aatgagacag 

gtcagtgttt 

gacattattg 

ctacactact 

gtgggtgaca 

tacagcatct 

cgtctggcta 

ctcctcacca 

gacaacaccg 

ctgggcgtga 

tcccaaagga 

gggcctgtgg 

ggcttctctt 

tcccagccca 

agctaccccc 

acgcagatga 

gggcagacag 

gggaacatca 

cactctgact 

gcgactcaag 

aactccgcca 

cgatactggc 

gagcccctgg 

atccgggagc 



tcgccgcgct 
gggctcagag 
gtgtggctct 
tgctggacca 
ctggccagaa 
acttccatta 
tgaaggtgaa 
agggctgggt 
tatttgaatc 
tccttgctca 
atgtggggca 
agctttggct 
accacaggcg 
agtaccgctg 
tcgtgaaaga 
acctgtggat 
aagtggaata 
actataagct 
gaccaggtga 
cagatccggt 
ccctgcagtg 
tgcagtacca 
cctcccacta 
tgtctaaccc 
ttccaggagc 
tccagtggaa 
aggctgtcgg 
agctccggaa 
tcaccatcaa 
ccaaaatttc 
acacgaccat 
atcagctggt 
agtgcttttc 
ttgctgctga 
ataagacata 
acttccaggc 
caaaagcacc 
caggtgcctc 
tgaagatggc 
tgctcaccat 
agctggccaa 
cctctgccga 
ctagttctca 
ccctcacgat 
gggaccagtt 
agagaggcca 
cttcgtggga 
tatcctacga 
acatcaatgc 
gtccgatgca 
gcatcgtcat 
cagatgacac 
cagaatacgt 
tccgcctctt 



cgccctcagc 
cgccccaggt 
agggaccaat 
ggcagtgccc 
ggcccacctt 
ctacttctcc 
tggtggcccc 
gaaggcagag 
cgtctcattg 
tccatgcaga 
gaatgccaca 
ccagcaatgg 
cttctcagcc 
tgtgatccgc 
gcctcccacg 
caagccaaat 
tcgcaccacc 
gtggcatctg 

ggggggtacg 

acatggccca 
ggagcccttc 
gtatgtgttc 
caccctgcga 
cgagggccga 
tgttcctcta 
acctcccaat 
ctcgctggac 
tgaaacccac 
ggccagcaca 
agctccatcc 
cacagtgatg 
tgtcaaggag 
ggtgcccgtg 
gttgaagcct 
caatggctac 
actcagcaaa 
aatgggcagc 
cacccagaat 
tggcgtgatc 
caaaaggaga 
gaagcagaag 
caaacccacc 
ggacgtcaac 
ccagactcat 
ccaactcgcc 
gggctacggg 
cacagccaag 
ccattcccgg 
caactacatt 
ggagactgta 
ggtcacaaac 
ggaggtctac 
catacgcacc 
ccacttcacc 



ctgctcctga 
ggctgttcct 
gggttcacct 
acaggatctt 
ctcctgccaa 
agccgtgaca 
caagggaacc 
ctcgccatca 
aagggtcatc 
aaagcacctc 
tttcagtgca 
aatggcaggg 
acagtcagtg 
tctgatggtg 
cccattgctc 
gccaactcca 
acaggcacgt 
gaccccgatg 
ggaccgccag 
cagaacgtgg 
ggctacgcgg 
aaccagcagc 
ggcctgcgcc 
atggagagcg 
gaatccatcc 
gagaccaatg 
ccaagtgctg 
cacctctttg 
gcaaagggct 
atgcctgagt 
ctgaaacccg 
gagcgacttc 
agctatcgga 
gccaacctgc 
tggaaccctc 
gccaatggag 
gcccaggtga 
tctaacactg 
gctggcctcc 
agaaatgctt 
gagacccaga 
accaagctca 
ggattcacag 
ccctaccgca 
atccgggtgg 
ttcaaggagg 
gaggatgaaa 
gtgaggctgc 
gacggatacc 
aaggactttt 
ctggtggaag 
ggagacatta 
ttcacagtcc 
agctggcctg 



ggctgcagct 
ttgatgagca 
gggagcagat 
tcatgatggt 
ccctgaagga 
ggtccagccc 
ctgtgtggaa 
gcactttctg 
ctggctacat 
attttctgcg 
ttgctggtgg 
acacggccct 
tggcagacac 
ggtctggtgt 
ccccagagct 
tcatcgggga 
gggcagagac 
ttgagtatga 
gggctcccct 
aaatcgtaga 
tgacccgctg 
agtacgaggc 
ccttcatgac 
aggagctggt 
aaggggggcc 
gggtcatcac 
acctctcgag 
tgggtctgta 
ttgggccccc 
acgacacaga 
ctcagtcccg 
agaagtcacg 
atgcctccag 
ctgtcaccca 
ctctctctcc 
agaccaaaat 
ccccggggac 
tggagccaga 
tcatgttcat 
attcctactc 
gtggagccca 
gcgccagccg 
atggcagccg 
cctgtgaccc 
ctgacttgct 
aatacgaggc 
accgcaataa 
tggtgctgga 
atcgacctcg 
ggagaatgat 
tgggcagggt 
aagtcaccct 
agaagaaagg 
accacggcgt 



gccgccactg 
ctacagcaac 
taacacaacg 
gaacagctct 
gaatgacacc 
aggggccttg 
tgtgtccggg 
gccacatttc 
cgccgtggac 
actccaaaac 
gaagtggtct 
gatggtcacc 
tgcccagcgg 
gtccaactac 
gctggctgtg 
tggccccatc 
ccacatagtc 
gatccgagtg 
caccaccagg 
catcagagcc 
ccatagctac 
cgaggaggtc 
catccggctg 
ggtgcagact 
ctttgaggag 
gctctacgag 
ccagaggggg 
cccagggacc 
tgtcaccact 
caccccattg 
gggagctcct 
gagggcagct 
cctcgattct 
gccatttaca 
cctgaaaagc 
caactgtgtt 
tccactctgc 
gaagcaggtg 
catcattctc 
ctattacttg 
gagggagatg 
caatgatgaa 
cggggagctt 
tgtggagatg 
gcagcacatc 
cttaccagag 
gaatcgatat 
tggagacccg 
gcactacatt 
ctggcaggag 
gaaatgtgtg 
gattgaaaca 
ctaccatgag 
tccctgctat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 
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gccactggcc ttctgggctt cgtccgccag gtcaagttcc tcaacccccc ggaagctggg 3300 

cccatagtgg tccactgcag tgctggggct gggcggactg gctgcttcat tgccattgac 3360 

accatgcttg acatggccga gaatgaaggg gtggtggaca tcttcaactg cgtgcgtgag 3420 

ctccgggccc aaagggtcaa cctggtacag acagaggagc aatatgtgtt tgtgcacgat 34 80 

gccatcctgg aagcgtgcct ctgtggcaac actgccatcc ctgtgtgtga gttccgttct 354 0 

ctctactaca atatcagcag gctggacccc cagacaaact ccagccaaat caaagatgaa 3600 

tttcagaccc tcaacattgt gacaccccgt gtgcggcccg aggactgcag cattgggctc 3660 

ctgccccgga accatgataa gaatcgaagt atggacgtgc tgcctctgga ccgctgcctg 3720 

cccttcctta tctcagtgga cggagaatcc agcaattaca tcaacgcagc actgatggat 3780 

agccacaagc agcctgccgc cttcgtggtc acccagcacc ctctacccaa caccgtggca 3840 

gacttctgga ggctggtgtt cgattacaac tgctcctctg tggtgatgct gaatgagatg 3900 

gacactgccc agttctgtat gcagtattgg cctgagaaga cctccgggtg. ctatgggccc 3960 

atccaggtgg agttcgtctc cgcagacatc gacgaggaca tcatccacag aatattccgc 4020 

atctgtaaca tggcccggcc acaggatggt tatcgtatag tccagcacct ccagtacatt 4080 

ggctggcctg cctaccggga cacgcccccc tccaagcgct ctctgctcaa agtggtccga 4140 

cgactggaga agtggcagga gcagtatgac gggagggagg gacgtactgt ggtccactgc 4200 

ctaaatgggg gaggccgtag tggaaccttc tgtgccatct gcagtgtgtg tgagatgatc 4260 

cagcagcaaa acatcattga cgtgttccac atcgtgaaaa cactgcgtaa caacaaatcc 4320 

aacatggtgg agaccctgga acagtataaa tttgtatacg aggtggcact ggaatattta 4380 

agctcctttt agctcaatgg gatggggaac ctgccggagt ccagaggctg ctgtgaccaa 4440 

gccccctttt gtgtgaatgg cagtaactgg gctcaggagc tctgaggtgg caccctgcct 4500 

gactccaagg agaagactgg tggccctgtg ttccacgggg ggctctgcac cttctgaggg 4560 

gtctcctgtt gccgtgggag atgctgctcc aaaaggccca ggcttccttt tcaacctaac 4 620 

cagccacagc caagggccca agcagaagta cacccacaag caaggccttg gatttctggc 4680 

tcccagacca cctgcttttg ttctgagttt gtggatctct tggcaagcca actgtgcagg 4740 

tgctggggag tgggaggctc ccctgccctc cttctcctta ggagtggagg agatgtgtgt 4800 

tctgctcctc tacgtcatgg aaaagattga ggctcttggg ggtcactgct ctgctgcccc 4860 

ctgcaacctc cttcaggggc ctctggcacc agacatttgc agtctggacc agtgtgacct 4920 

tacgatgttc cctaggccac aagagaggcc ccccatcctc acacctaacc tgcatggggc 4 980 

ttcgcccaca accattctgt accccttccc cagcctgggc cttgaccgtc cagcattcac 5040 

tggccggcca gctgtgtcca cagcagtttt tgataaaggt gttctttgct tttttgtgtg 5100 

gtcagtggga gggggtggaa ctgcagggaa cttctctgct cctccttgtc tttgtaaaaa 5160 

gggaccacct ccctggggca gggcttgggc tgacctgtag gatgtaaccc ctgtgtttct 5220 

ttggtggtag ctttctttgg aagagacaaa caagataaga tttgattatt ttccaaagtg 5280 

tatgtgaaaa gaaactttct tttggagggt gtaaaatctt agtctcttat gtcaaaaaga 5340 

agggggcggg ggagtttgag tatgtacctc taagacaaat ctctcgggcc ttttattttt 5400 

tcctggcaat gtccttaaaa gctcccaccc tgggacagca tgccactgag caaggagaga 54 60 

tgggtgagcc tgaagatggt ccctttggtt tctggggcaa atagagcacc agctttgtgc 5520 

ataatttgga tgtccaaatt tgaactcctt cctaaagaaa cccagcagcc accttgaaaa 5580 

aggccattgt ggagcccatt atactttgat ttaaaatagg ccaagagaat caggcctgga 5640 

gatctagggt cttgtccaaa gtgtgagtga gtcaatgaga gggaaccaac atttgctaag 5700 

tctctactgt atgccaggga tcatgcttgg cactttccat aggacatttc acacagtcct 5760 

tagaaccccc aggagagagc tactgacttg ttatcatctc catttgatca tctcctccaa 5820 

tgaggaaacc cacgcacctt ccttagtaat gaaatcctgg gttccaaagg ggcaggtaat 5880 

ggcaatgaga cttctccgtg ctgttttctt catcttctct aagccaagca attattttat 5940 

ggagggaaaa taaggccaga aacttctgag cagataactc cacaaatgga aatttagtac 6000 

tttcttcctg atgccagttc ttctgggaag cgcagaattt cagatatatt ttagtaacac 6060 

attcccagct ccccaggaaa gccagtctca tctaatttct tagtcagtaa aaacaattcc 6120 

ctgttccttc aggctatgaa tggaccagcc agggaaactc tcgaccttga tctctagcca 6180 

gtgcttaggc ccaatatctg acagcctcag gtgggctggg acctaggaag ctccatcttg 6240 

aaggctggtc tagccccaga cagggcatga ggggcagaga attcaagaag gtacagcttt 6300 

ggccctcaag agcccactgt atgctgggga aatggaacca tggtgcagta gtgtggagtg 6360 

gatgagtgtt ccatgagcct aggagcaaga aagtctcttc ggcctcgggc ttcctggaga 6420 

aggggacgtc cattcctgct gggtcttaac aagcataaaa aggaaaaaaa ggaaactcag 6480 

gcaaagggat ccatatgtgc aatggcaaag aaatgtgaaa aggcattggg agaagcagtc 6540 

tgggggaggc cagcccagtg cgggcacagc acaacacggg gagcagcaag agatgagcca 6600 

gggtccagga gacagatgcc catcgcgagt acagactttg tcctattggc aacaaggagt 6660 

ccatggagct ttagagagat gcactcagct tcgtgttggc caagactcct tctgggccaa 6720 

tggggctgcc tcttttcctt tcatcagaca ctgtgaaaac attcccttaa gcgtgcactt 6780 
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tttaatatca catctatttg tctgtctgct 
atggatcatg agactcagat tctatgagaa 
ggtcaccaac ccagatctcc aggcccatga 
tgcttccatt ggcatgggct ctggagctgt 
aggaagggct gtcactttag aggttcaaaa 
gaaccccacg atgaacttgc tcttttcctt 
agacncccgg ggacccancc cccactgggc 
ccccagctgg ccttccccag cctgcaagga 
tgctattttc ttagatcttg gtacatccag 
acacaaatcc taagattttt ttctgctcag 
gacagtggct tttatcattt taaatggtag 
aacttgcaaa aaaaaaaaan nnntctncag 
tgatgatttc ccccagctca tccccttatt 
gacagtgaat aggtcttagg ccagtgcaca 
tgacttccct aagatcccat gggacaagga 
agtgcccagg aatttgaaca ttttccttta 
acccccaggg ggtctncccc accccacccc 
agncagagga aaata'ttnat tnaatgctct 
accntcatag natgtngggc tctgtttggg 
gaaatttctc cacctgtgtt tatggcctaa 
caacattgca tcctttctgg ggtgaagaat 
gaaggccaca gtaggcctct ctttgtggga 
tggccctctc tgtgggtgtg gcctgccaaa 
cctaacatgt gagcctccat taaattcctt 
gtcaccattt actgatgagg aaactaaggc 
cccacagcta gtgagtgaat gaaccaggat 
atatttttcc accattgtat ctcacatttt 
tagagtggaa cagatacgtc agtggataaa 
tccttcatag tctcatgttt tacgttcaca 
atttagcctt gttggtttta atatgactct 
tcctcatgtt gttctgttct gttcaccccg 
agcttgaccc tagaattgac tctaggagca 
aggctcaaga tcaagaccaa ctgaccttct 
ctgacctcac tgttcatgag gggacctcaa 
gaacccatta actcacacta gaatgatgag 
gcccacaaaa ccctaaaggg aaaagagaag 
aatgctaggc tggaaagtgg tgcctgttgt 
gcaagtctga ggacacactt catgagcagc 
agccacacat aattactggg tgcctacagc 
aaatcgagat tactatggca caattaattc 
agctcagtct ttagtttgaa cgaaaacata 
ccctcatcaa cattagaagg ggtagatttg 
aggccaggaa actggaaccc agtgaatact 
caaaatggtc acttctgtga catagctgga 
cttggaacaa gggcaaatct cagtgtctgg 
atcatgttta ttcatcctcc actccccatt 
caggatctac atttcctcgt cgctggtggg 
cagtaagact gctctacaca acagaagtga 
ctgttacctc tgcaggagtg actgcccaac 
attccttggt acctcactgc cttgggacaa 
agcttgccac tcctgggctg ggtaaacagg 
attctggaag acatgggtga gcccaagagg 
tgtcaacagg ttgtgctggg ccaggatccc 
ggtaggttag aagaaaaaaa acataaacgt 
tggcttcccc attacttgac agtctgagct 
cagccaagta aaacaaggaa ataggaaaag 
tcagatttgt ttactcctcc catgcaaagt 
gtctagaagt gatagaatgc aaggccttgg 
catagactga acagcaggga gagaggttag 



cattgttttg ttgctggaac taaatatgca 6840 
acccagggtc tctgctttac cacggagcag 6900 
ggatggaaca tgaaaggagc cgacaaaagt 6960 
ccagaagtcc agggacacca gacttgatca 7020 
ggaagtgcct caaagcaaag gcaagcaaag 7080 
tgatgagcct ctccccaggt gtatttcagc 7140 
ctgctggcct ccctcggctc cagcccaatg 7200 
gcctgtagca tggcaaatct gcctgctgta 7260 
acaggatgag ggtggaggga gagctattta 7320 
gaaggggtga aatagctggc agatacaaaa 7380 
gaatttaagg tgtgacttca gggagaaaca 7440 
gccatgttgg ggtaacccag caagggccag 7500 
ttcccacaac ccaaccattc tctaaagcag 7560 
caggaagaaa ttgaggctta tggatgggga 7620 
tgtggcaagg cttggatgag atggggcacc 7680 
cccaggnaaa tctccggagc caacaccacc 7740 
atttacaggg tgagctcagc ctgtncatng 7800 
ctngagtctt tnnacaacag gnagctcttn 7860 
gaagatgcaa ggaagtaatg agaagcccag 7920 
atagcttcag gatgtatctt agctgcactc 7980 
ctgggccaac caggggtcct tgggcctcta 8040 
atggaaaggg gacagtttgc tttttagtgc 8100. 
ggaaccaaca ngaccctatg cctggggact 8160 
cccagcattc ctaaaggagg gtttgtgatt 8220 
tcctagggga gaaatcactt gcccacagtt 8280 
ttaaaccggt tttttctcac tacagagaca 8340 
tcccaggagg ttacccataa cagaagagac 8400 
gctcaaagca ,aacaacagta agcttaaaat 8460 
attcatgcaa aatttgcatt ccactttctg 8520 
atgaatattt caaaaaaaaa tgtgctctgt 8580 
ctatgacgga ccctaggtca gctggtcttc 8640 
gtgaccctgc tgcctcccag agccagttat 8700 
cctaggcagc tcctttggtg tgtgggtgct 8760 
ctaaggcatc ttccagttgg gtgctggaag 8820 
gatttgctca tctggcgtgg agaaggatga 8880 
ctggacacag ctgtactcag cagattcctg 8940 
ccaagtggag tcacatggtt gctaatgtgg 9000 
tggggtctgg aaggctcctc actttaccct 9060 
acctagcacc ttggaggggg cactattagg 9120 
ctgggtaagg catggggttg tggtggacag 9180 
catacatgaa aaacatacat gaaaaaagga 9240 
gagcacttta ggcaggaaaa caggaacgca 9300 
cagaaccgag gatgcagatg acttatttag 9360 
gaaaggatgg gtaacagctt gccagagcca 94 20 
ggcaaaagat gatgcatttc cctctgaccc 9480 
gccacactag ctcttgctgt aagtcctcac 9540 
aaccccttag agtacataga ggtatcagtc 9600 
ggcccaggga gtagcagcca ggcccttatc 9660 
ccagatccag agacattgaa ggaaatgata 9720 
aatgaagaaa gccacccttc cttaggctgc 9780 
tcatcagcac caggctcaac caggagtaac 9840 
aagcatgaac aggacgctgt tcctaagtca 9900 
cagggaaaaa aatggtcaac ccaactggag 9960 
ggatagtcat gtcatctcaa atccctgact 10020 
ccttcttagc ctgtgaccag cttcaaatca 10080 
taaatccaac tagaagagac aagctgagat 10140 
ttccctgttg gaggttttcc atgtatacat 10200 
ctttgtcttg cagggatctg cctttgaggt 10260 
tggtggagtg tggggggagc tgttctagct 10320 



wo 03/000012 PCT/US02/19773 

21 

ccagtttctt ctgacacatt tttcaggatc atggatctga tcctccgaag cacagcagag 10380 
atatctaagc catatttgtg cacatgagca gactcttcta gttttttagt aaccagggat 10440 
gggcttttgc atggcactga ctatagagat gtcttgtaga gatcaagcca gtcttttgca 10500 
tcccacctgc ccacctccag aagagatggg aaaaggtcat caaagggcat tcaccaactg 10560 
aaatccactc atgaatgtta ggtctctaaa aggaggcatc aacactcaca atggtagcct 10620 
ccaaacctag catcccacct atctaagagc tcaggggtgg tccactgggg cagatacaag 10680 
ggaagtgcaa gggctcagga tgaaagaaaa tctattggga agagttttag gggcttgatc 10740 
attatggggc ttccttctat atctgagaac tgctctgggt ggtgagatgt ggactctgat 10800 
ccttaattgg aatgttcgga gaatgagtgt ctggtggcct tgaagtgttg gacagaaaag 10860 
tatcagtata aaagcctgga gctcagggta attaatgtag ttcatggttc cttagtgagc 10920 
aggactcttg gatgtggagg agaaagggtc ataggaagta aaccaccaaa attacaaaat 10980 
tgagtctctg tacaattact tcagtgcctt tgggcttatg aatacaaatc agtgggcctt 11040 
ctctatgatg gtccaacaaa ctctcagtgt ccaccctgtc cctgtatctc ccatggaaga 11100 
tgaataatgt caggtgttct ttgggtcaaa ggccccaggg cagtctggag gcttagaggg 11160 
cagagtggtg tcattccatg taaagttagg cttctgaggg gtcaggcaga atatggtgtc 11220 
catatcttcc atagctctgc agattcttgg natgaagtca agcacagttt gctagaccca 11280 
ggntcactcc tctngagtat naactaggna cccatgagtg aaacttaata gctgtnaagg 11340 
naagnaacct gnctgtnctg ccnagagagg atnaagctgn cccatctcag ncagctgtnc 11400 
tnaaaagaag gncaggtgtc tctnttnnaa agggnaagag gaagncattg gtggaaatgg 114 60 
nattttcagg tncacttncc attcccangn atgggtngag atcttgtgga gctgggatnc 11520 
atgtttngaa ctcattcata cctgtagnag ncacgaattc caagtagatt gtgtttggtc 11580 
tgtacaggct gaagccccct gctctcccac ccaagtgccc ccactgagca ggccaacatg 11640 
ctgttgtggc cacatatact gggctgatcc aggctggtta tcaccaaaca gcaaaccata 11700 
gggaacagct gctttgccat agacccaata cccatgtaga tctctcatga gagcagccat 11760 
aactcagacc cactgaccaa cagggccatg agtgacagcc agaaccagtg aaggtccaag 11820 
taggacacag agcagggctt ttcttaccat acacattatc tccagaggtt atttctaccc 11880 
cactccctat tcaaggcctg ttggagcaca ctgcaaaagc aaaagcacag taactcaatt 11940 
tacacatgat tataatcatt tccagtgcac acatttcatc accaggtgga tcctgagcta 12000 
gcccatgtaa atccgggtta acccatattg gtaatcatac tcaaaagcac ttttcaccct 12060 
acattctact agccaatcaa agacaaagag ttgtggcctc taccattgcc ttggcttctg 12120 
gacaccctca caagctatcc caaggttccc ggctcaaccc ccagggrggc cggacatcct 12180 
tcacatccca ckgggccata aaaatattgc catgagaccc aaagtcctcc cacactcttt 12240 
gcagccctcc tcccatgaat ccccaatggc ctgcacttgt nacagtttgg gtgtttgnat 12300 
agataaagca cgtatgagaa gagaaaacaa aataaatcaa ctttttaaaa aagccagcac 12360 
tgtgctgtca atgttttttt tttcttttca attnctagct cagaaaagca gaaggtaaat 12420 
aatgtcaggt caatgaatat cagatatatt ttttgactgt . acattacagt gaagtgtaat 12480 
ctttttacac ctgcaagtcc atcttattta ttcttgtaaa tgttccctga caatgtttgt 12540 
aatatggctg tgttaaaaaa tctatacaat aaagctgtga ccctgagaww matgttttcc 12600 
taagataaaa aaaangnnan nstnyknnnc tknnnnngtn hg 12642 

<210> 22 
<211> 1463 
<212> PRT 

<213> Homo sapiens 
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90 








95 


Glu Asn Asp Thr 


His 


Cys 


He 


Asp 


Phe 


His 


Tyr 
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100 
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Pro 


vax 


Ala 


Ser 


Ala 


Asp 


Lys 


Pro 


Thr 


Thr 


Lys 








Q o n 
oZU 










o o c 
OZO 










830 








Ser* 


Ala 


Ser 


Arg 


Asn 


Asp 


blU 


biy 


Phe 


Ser 


Ser 


Ser 


Ser 


Gin 


Asp 






Q T R 










o ^ n 










845 








V aX 


Asn 


oxy 


trne 


1 nr 


Asp 


ijxy 


Ser 


Arg 


Gly Glu 


Leu 


Ser 


Gin 


Pro 


Thr 




p R n 










<3 R R 










860 










Leu 


ixir 


lie 


oxn 


1 nr 


His 


Pro 


Tyr 


Arg 


Thr 


Cys 


Asp 


Pro 


1 7 -« 1 

Val 


Glu 


Met 


ODD 










Q T n 
o /U 










875 










880 


C d V 


Tyr 


Pro 


Arg 


Asp 


oin 


rne 


bin 


Leu 


Ala 


He 


Arg 


Val 


TV T -> 

Ala 


Asp. 


Leu 










Q 0 R 










890 










one: 

895 




Lsu 




nxs 


T 1 o 

X xe 


i nr 


vjXn 


Met 


Lys 


Arg 


Gly Gin 


Gly Tyr 


Gly 


Pne 


Lys 








on A 










Q AR 

yuo 










A 1 A 

910 








ox u 


T VT* 

lyr 


V3X U 




X>cU 


rx O 


mil 


vjxy 


Gin 


Thr 


Ala 


Ser 


Trp 


Asp 


inr 






Q1 R 
yiD 




















925 








MXO 




oxu 


Asp 


m 11 
uXU 


Asn 


Arg 


Asn 


Lys 


Asn Arg 


Tyr Gly 


Asn 


lie 


T 1 ^ 

lie 














Q "5 R 

y Jo 










940 












iyr 


Asp 


nxs 


oer 


Arg 


vax 


Arg 


Leu 


Leu 


Val 


Leu 


Asp 


Gly 


Asp 


Pro 












you 










955 










ft ^ A 

960 


His 


0\5X 




iyr 


X xe 


Asn 


riXa 


Asn 


Tyr 


He Asp 


Gly Tyr 


His 


Arg 


Pro 










OCR 










970 










975 






His 


Tyr 


xxe 


Aia 


xnr 


Gin 


Gly 


Pro 


Met 


Gin 


Glu 


Thr 


Val 


Lys 


Asp 








you 










985 










990 






Phe 


Trp 


Arg 


Met 


lie 


Trp 


Gin 


Glu 


Asn 


Ser 


Ala 


Ser 


He 


Val 


Mat* 


VaX 






995 










1000 








1005 






Thr 


Asn 


Leu 


Val 


Glu 


Val 


Gly 


Arg Val 


Lys 


Cys 


Val 


Arg 


Tyr 


Trp 


Pro 




1010 








1015 








1020 








Asp 


Asp 


Thr 


Glu 


Val 


Tyr 


Gly 


Asp 


He 


Lys 


Val 


Thr 


Leu 


He 


Glu 


Thr 


1025 








1030 








1035 








1040 


Glu 


Pro 


Leu 


Ala 


Glu 


Tyr 


Val 


He 


Arg 


Thr 


Phe 


Thr 


Val 


Gin 


Lys 


Lys 
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1045 1050 1055 

Gly Tyr His Glu lie Arg Glu Leu Arg Leu Phe His Phe Thr Ser Trp 

1060 1065 1070 

Pro Asp His Gly Val Pro Cys Tyr Ala Thr Gly Leu Leu Gly Phe Val 

1075 1080 1085 

Arg Gin Val Lys Phe Leu Asn Pro Pro Glu Ala Gly Pro lie Val Val 

1090 1095 . 1100 

His Cys Ser Ala Gly Ala Gly Arg Thr Gly Cys Phe lie Ala lie Asp 
1105 1110 . 1115 1120 

Thr Met Leu Asp Met Ala Glu Asn Glu Gly Val Val Asp He Phe Asn 

1125 1130 1135 

Cys Val Arg Glu Leu Arg Ala Gin Arg Val Asn Leu Val Gin Thr Glu 

1140 1145 1150 

Glu Gin Tyr Val Phe Val His Asp Ala He Leu Glu Ala Cys Leu Cys 

1155 1160 1165 

Gly Asn Thr Ala He Pro Val Cys Glu Phe Arg Ser Leu Tyr Tyr Asn 

1170 1175 1180 

He Ser Arg Leu Asp Pro Gin Thr Asn Ser Ser Gin He Lys Asp Glu 
1185 1190 1195 1200 

Phe Gin Thr Leu Asn He Val Thr Pro Arg Val Arg Pro Glu Asp Cys 

1205 1210 1215 

Ser He Gly Leu Leu Pro Arg Asn His Asp Lys Asn Arg Ser Met Asp 

1220 1225 1230 

Val Leu Pro Leu Asp Arg Cys Leu Pro Phe Leu He Ser Val Asp Gly 

1235 1240 1245 

Glu Ser Ser Asn Tyr He Asn Ala Ala Leu Met Asp Ser His Lys Gin 

1250 1255 1260 

Pro Ala Ala Phe Val Val Thr Gin His Pro Leu Pro Asn Thr Val Ala 
1265 1270 1275 1280 

Asp Phe Trp Arg Leu Val Phe Asp Tyr Asn Cys Ser Ser Val Val Met 

1285 1290 1295 

Leu Asn Glu Met Asp Thr Ala Gin Phe Cys Met Gin Tyr Trp Pro Glu 

1300 1305 1310 

Lys Thr Ser Gly Cys Tyr Gly Pro He Gin Val Glu Phe Val Ser Ala 

1315 1320 1325 

Asp He Asp Glu Asp He He His Arg He Phe Arg He Cys Asn Met 

1330 1335 1340 

Ala Arg Pro Gin Asp Gly Tyr Arg He Val Gin His Leu Gin Tyr He 
1345 1350 1355 1360 

Gly Trp Pro Ala Tyr Arg Asp Thr Pro Pro Ser Lys Arg Ser Leu Leu 

1365 1370 1375 

Lys Val Val Arg Arg Leu Glu Lys Trp Gin Glu Gin Tyr Asp Gly Arg 

1380 1385 1390 

Glu Gly Arg Thr Val Val His Cys Leu Asn Gly Gly Gly Arg Ser Gly 

1395 1400 1405 

Thr Phe Cys Ala He Cys Ser Val Cys Glu Met He Gin Gin Gin Asn 

1410 1415 1420 

He He Asp Val Phe His He Val Lys Thr Leu Arg Asn Asn Lys Ser 
1425 ^ 1430 1435 1440 

Asn Met Val Glu Thr Leu Glu Gin Tyr Lys Phe Val Tyr Glu Val Ala 

1445 1450 1455 

Leu Glu Tyr Leu Ser Ser Phe 
1460 



<210> 23 

<211> 1297 

<212> DNA 

<213> Homo sapiens 
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<400> 23 

gtcgacccac 

cccactccac 

cccgggccag 

ctttcctgtc 

gcacagcctc 

catcgtcatg 

ggagactcca 

aaattgggtg 

ccaggatgga 

ctctcgcaag 

caagcagcct 

gttggcgtgc 

ccacaggcct 

ccatgggaac 

tctaaccaga 

gacataggat 

tgaccctctc 

gtccctgcaa 

tagtgtgaga 

catgcgaaat 

tatctcctcc 

tgtcccctca 



gcgtccgtgc 
ggtgcagctg 
cgctctgaca 
ctggacgcca 
caggttggcg 
agtgcaaaat 
cctctcatca 
gaggaccgtt 
attcttcctc 
agggtctctt 
ggagacttcc 
cctggaggct 
gttcccttgt 
ctcatatgca 
tggaacactg 
cgtcccgcta 
acttaccccg 
gcccgttaaa 
acttccaaca 
gttctcatga 
aatgtcctga 
aaaaaaaaaa 



tcagcctggt 

cggcttatct 

tgcagaaggt 

atgacctaga 

ggctcatctg 

gcaaatgcaa 

ccccaggctc- 

ctctgtcccc 

ctctgctggg 

tgttcaattt 

tatgtgtgca 

gacacagagg 

ggcttgggac 

atttgggatt 

gaacattcca 

tgatggaagt 

agacctcact 

taattcccta 

agcctcaaag 

actaccccac 

aacatgaacc 

aaaaaaaggg 



gaaccacaca 
ctcagcccag 
gaccctgggc 
agataaaaac 
cgctggggtt 
gtttggccag 
agcccaaagc 
aggtcctgtc 
actcctttgc 
tttttaatct 
ttggggtggg 
ctggcactga 
atggcacagg 
tactagtagc 
gtggaccctg 
gttcagacag 
ttattacaag 
tgctaccctt 
tcccttgaga 
aacacgccta 
caaaaagaga 
cggccgc 



ggcccgagtt 
cgagatgcca 
ctgcttgtgt 
agtcctttct 
ctgtgcgcca 
aagtccggtc 
tgatgaggac 
tctgcacaga 
atggcagggc 
aaaatgattg 
gcttggggca 
gcctgcttgt 
cccgccctct 
caaaaggaat 
gaccattcca 
tttataatag 
atctttccaa 
aataacatac 
ctccccaata 
aaactcaaaa 
cccacaataa 



tcacccagtc 
gccttcctgt 
tcctggcagg 
actatgactg 
tgggcatcat 
accatccagg 
agaccagctg 
aacttgaact 
ctcatctcac 
tgcctctgcc 
ccatgagaag 
tgggaaaagc 
gcctcctcag 
gaaagagagc 
ggaaaactgg 
taagcccctg 
atacccaaat 
aatgaccaca 
cctaataagg 
cacccaaaaa 
actcgtgact 



<210> 24 
<211> 87 
<212> PRT 

<213> Homo sapiens 

<400> 24 

Met Gin Lys Val Thr Leu Gly Leu Leu Val Phe Leu Ala Gly Phe Pro 

15 10 15 

Val Leu Asp Ala Asn Asp Leu Giu Asp Lys Asn . Ser Pro Phe Tyr Tyr 

20 25 30 

Asp Trp His Ser Leu Gin Val Gly Gly Leu lie Cys Ala Gly Val Leu 

35 40 45 

Cys Ala Met Gly lie lie lie Val Met Ser Ala Lys Cys Lys Cys Lys 

50 55 60 

Phe Gly Gin Lys Ser Gly His His Pro Gly Glu Thr Pro Pro Leu lie 
65 70 75 80 

Thr Pro Gly Ser Ala Gin Ser 
85 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1297 



<210> 25 

<211> 1888 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 1814, 1834, 1850 

<223> n = A,T,C or G 



<400> 25 

aggtacgcgg gggaataatg tgtggcttct 
ggcaatgctt ccccgaggtg tgcacctttg 
ggcgctactc gcgggacgcc gtcacgtgat 



gttggattgc ttttctttcc aaaattccta 60 
tgaggtgttt gtggggttgg gggagcttca 120 
ccgggacgag gtggagttcg gctttaagga 180 
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ggcgtctctt 
ttccccactc 
gacaaattcc 
aaatggattc 
atactgtcca 
ttattttccc 
cttctggctt 
cttgcttatg 
atcttgaaac 
tcagacttgc 
cactgggtct 
actgtgtcca 
cctcgagggt 
cttgctgtgg 
caaaatgtaa 
gactctcata 
tgttttatat 
atggacaaga 
ctgctctcca 
gtgtttgatt 
tgccacctca 
aaaaacttcc 
gatgattatg 
aagcaagcaa 
tgaaactact 
aacatacttt 
agattttctt 
aacaaaaggg 
tttgaagtgc 



cctagcttca 
tgctcccttc 
aaagaggatt 
caatatgggg 
tttgtctaaa 
aggttgaaaa 
ttgaaaatga 
gagctgtgat 
aaaaggagat 
ttgttgccat 
ttggtgaggc 
ttttctctct 
ggagaccaaa 
cttcttcttt 
cacttgatgc 
ggttgtctta 
ttatttgcta 
tgagagacaa 
ttgtggtagc 
ggaatcatca 
cagcaatgat 
agagagactt 
aaacaatagc 
gcccagtcgc 
tatagcctat 
gattacctgt 
gtcttggctt 
tgtngggctt 
tttttgtgaa 



tcaatcttta 
cattcccacc 
gttcagttca 
aataagaata 
ataatctata 
tcattcagtc 
tgattgtcat 
cattcttggt 
gagaaatgtt 
catgtgtctc 
gatgtgtaag 
ggttctcatt 
taatagacat 
gcctttcctg 
gtacaaagac 
taccactctc 
cttcaagata 
taagtacagg 
atttgcagtc 
gatcattgct 
atccacttgt 
gcagttcttc 
catgtccacg 
atttaaaaaa 
ggtcccggat 
tctcccaagg 
tttactgctt 
tkgggatctt 
tttaccag 



ggatctgagc 
cttccttctt 
agggaatgaa 
agctgaacag 
acaaccaaac 
cactctaatt 
ctgcccttgg 
gtctctggaa 
accaacatcc 
ccctttacat 
ttgaatcctt 
gctgtggaac 
gcttatgtag 
atctaccaag 
aaatacgtgt 
ctcttggtgc 
tatatacgcc 
tccagtgaaa 
tgctggctcc 
acctgcaacc 
gtcaacccca 
ttcaactttt 
atgcacacag 
atcaacaaca 
gacatctgtt 
amtggggttg 
ttgttgtagt 
tctnggrrat 



aggagaaata 
taataagcag 
gaattcagaa 
ttgacctgct 
caatcaaaat 
tctcagagaa 
ccatgatatt 
acctggcctt 
tgattgtgaa 
ttgtctacac 
ttgtgcaatg 
gacatcagct 
gtattgctgt 
taatgactga 
gctttgatca 
tgcagtattt 
taaaaaggag 
ccaaaagaat 
ctcttaccat 
acaatctgtt 
tattttatgg 
gtgatttccg 
atgtttccaa 
atgatgataa 
taaaaacaag 
aaatcatttg 
tgtcataatt 
tagkkgttgn 



ccagcggatc 
gagcgaaaaa 
taattttggt 
ttgaagaaac 
gaattcaaca 
gaatgcccag 
taccttagct 
gatcataatc 
cctttccttc 
attaatggac 
tgtttcaatc 
gataatcaac 
gatttgggtc 
tgagccgttc 
atttccatcg 
tggtccactt 
aaacaacatg 
caatatcatg 
ctttaacact 
attcctgctc 
gttcctgaac 
gtctcgggat 
aacttctttg 
tgaaaaaatc 
cacaacctgc 
aaaatgacta 
tacatttggg 
accmgacatc 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1888 



<210> 26 
<211> 384 
<212> PRT 

<213> Homo sapiens 



<400> 26 



Met 


Asn 


Ser 


Thr 


Leu 


Phe 


Ser 


Gin 


Val 


Glu 


Asn 


His 


Ser 


Val 


His 


Ser 


1 








5 










10 










15 




Asn 


Phe 


Ser 


Glu 


Lys 


Asn 


Ala 


Gin 


Leu 


Leu 


Ala 


Phe 


Glu 


Asn 


Asp 


Asp 








20 










25 










30 






Cys 


His 


Leu 


Pro 


Leu 


Ala 


Met 


He 


Phe 


Thr 


Leu 


Ala 


Leu 


Ala 


Tyr 


Gly 






35 










40 










45 








Ala 


Val 


He 


He 


Leu 


Gly 


Val 


Ser 


Gly 


Asn 


Leu 


Ala 


Leu 


He 


He 


He 




50 










55 










60 










He 


Leu 


Lys 


Gin 


Lys 


Glu 


Met 


Arg 


Asn 


Val 


Thr 


Asn 


He 


Leu 


He. 


Val 


65 










70 










75 










80 


Asn 


Leu 


Ser 


Phe 


Ser 


Asp 


Leu 


Leu 


Val 


Ala 


He 


Met 


Cys 


Leu 


Pro 


Phe 










85 










90 










95 




Thr 


Phe 


Val 


Tyr 


Thr 


Leu 


Met 


Asp 


His 


Trp 


Val 


Phe 


Gly 


Glu 


Ala 


Met 








100 










105 










110 






Cys 


Lys 


Leu 


Asn 


Pro 


Phe 


Val 


Gin 


Cys 


Val 


Ser 


He 


Thr 


Val 


Ser 


He 






115 










120 










125 








Phe 


Ser 


Leu 


Val 


Leu 


He 


Ala 


Val 


Glu 


Arg 


His 


Gin 


Leu 


He 


He 


Asn 




130 










135 










140 










Pro Arg 


Gly 


Trp 


Arg 


Pro 


Asn 


Asn 


Arg 


His 


Ala 


Tyr 


Val 


Gly 


He 


Ala 


145 










150 










155 










160 


Val 


He 


Trp 


Val 


Leu 


Ala 


Val 


Ala 


Ser 


Ser 


Leu 


Pro 


Phe 


Leu 


He 


Tyr 










165 










170 










175 




Gin 


Val 


Met 


Thr 


Asp 


Glu 


Pro 


Phe 


Gin 


Asn 


Val 


Thr 


Leu 


Asp 


Ala 


Tyr 
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180 










Lys 


Asp 


Lys 


Tvr 


V Cl i 


^y o 


Phe 


Asp 






J. ^ •J 












Leu 


Ser 


lyi 


Thr 


Thr 


Leu 


T .oil 






210 














Cys 


Phe 


Tie* 
i. ^ c 


Php 


Tip 
X xc 




lyr 




225 
















Arg Asn 


Hen 
ASM 


M^t- 


Mot* 


rtsp 


T \/c 




















Glu 


Thr 


Lys 


7\ yn 


T 1 P» 




Tl P 










^ O v' 










Ala 


Val 


v-*y o 


irp 


Lteu 




i^eu 


















280 


Asn 


His 


ox 1 1 


X X 7 


lie 


Ala 


Thr 


Cys 




290 










295 




Cys 


His 


Leu 


Th-r 


AT A 


Met 


He 


Ser 


305 










310 






Gly Phe 


Leu 


& c rt 




Asn 


Phe 


Gin 










'5 0 








Phe Cys 


Asp 


Phe 


Arg 


Ser Arg Asp 








340 










Ser 


Thr 


Met 


His 


Thr 


Asp 


Val 


Ser 






355 










360 


Pro 


Vai 


Ala 


Phe 


Lys 


Lys 


He 


Asn 




370 










375 





1 Q c; 
1 o D 










190 






Gin 


Phe 


Pro 


Ser 


Asp 
205 


Ser 


nX S 


Arg 


Val 


Leu 


Gin 


Tyr 
220 


Phe 


Gly 


Pro 


Leu 


Lys 


He 


Tyr 


He Arg Leu 


Lys 


Arg 






235 










O A f\ 

Z 4 u 


Arg Asp Asn 


Lys 


Tyr Arg 


Ser 


Ser 




250 










O c; 




Leu 


Leu 




He 


V d X 


Va 1 


Ala 


Phe 


265 










270 






He 


Phe 


Asn 


Th r 


Val 
285 


Phe 


Asp 


Trp 


Asn 


His 


Asn 


Leu 
300 


Leu 


Phe 


Leu 


Leu 


Thr 


Cys 


Val 
315 


Asn 


Pro 


He 


Phe 


Tyr 


Arg Asp 


Leu 


Gin 


Phe 


Phe 


Ph^ 


noil 




330 










335 




Asp Asp 


Tyr 


Glu 


Thr 


He 


Ala 


Met 


345 










350 






Lys 


Thr 


Ser 


Leu 


Lys 
365 


Gin 


Ala 


Ser 


Asn 


Asn 


Asp 


Asp 


Asn 


Glu 


Lys 


He 



380 



<210> 27 
<211> 852 
<212> DNA 

<213> Homo sapiens 

<400> 27 

ctagtcctga cttcacttct gatgaggaag cctctctcct tagccttcag cctttcctcc 60 

caccctgcca taagtaattt gatcctcaag aagttaaacc acacctcatt ggtccctggc 120 

taattcacca atttacaaac agcaggaaat agaaacttaa gagaaataca cacttctgag 180 

aaactgaaac gacaggggaa aggaggtctc actgagcacc gtcccagcat ccggacacca 240 

cagcggccct tcgctccacg cagaaaacca cacttctcaa accttcactc aacacttcct 300 

tccccaaagc cagaagatgc acaaggagga acatgaggtg gctgtgctgg gggcaccccc 360 

cagcaccatc cttccaaggt ccaccgtgat caacatccac agcgagacct ccgtgcccga 4 20 

ccatgtcgtc tggtccctgt tcaacaccct cttcttgaac tggtgctgtc tgggcttcat 4 80 

agcattcgcc tactccgtga agtctaggga caggaagatg gttggcgacg tgaccggggc 540 

ccaggcctat gcctccaccg ccaagtgcct gaacatctgg gccctgattc tgggcatcct 600 

catgaccatt ggattcatcc tgttactggt attcggctct gtgacagtct accatattat 660 

gttacagata atacaggaaa aacggggtta ctagtagccg cccatagcct gcaacctttg 720 

cactccactg tgcaatgctg gccctgcacc tggggctgtt gcccctgccc ccttggtcct 780 

gcccctagat acagcagttt atacccacac acctgtctac agtgtcattc aataaagtgc 840 

acgtgcttgt ga 852 

<210> 28 
<211> 125 
<212> PRT 

<213> Homo sapiens 
<400> 28 

Met His Lys Glu Glu His Glu Val Ala Val Leu Gly Ala Pro Pro Ser 

1 5 10 15 

Thr He Leu Pro Arg Ser Thr Val He Asn He His Ser Glu Thr Ser 
20 25 30 
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Val Pro Asp 


nlS 


vai 


Val 


Trp 


Ser 


35 










40 


Trp Cys Cys 


Leu 


biy 


Phe 


He 


Ala 


DO . 








55 




Asp Arg Lys 


Met 


val 


Gly Asp Val 


65 






70 






Thr Ala Lys 


Cys 


Leu 


Asn 


He 


Trp 






85 








Thr He Gly 


Phe 


He 


Leu 


Leu 


Leu 




100 


3 








His He Met 


Leu 


Gin 


He 


He 


Gin 


115 










120 



T m ^ 

Leu 


rue 


Asn 


Thr 


Leu 


Pne 


Leu 


Asn 










45 








rue 


AX a 


Tyr 


Ser 


Val 


Lys 


Ser 


TV 

Arg 








dO 










Thr 


ox y 


riJ. a 




211 A 

riXa 


iyr 


Ala 


OCX 






75 










80 


Ala 


Leu 


He 


Leu 


Gly 


He 


Leu 


Met 




90 










95 




Val 


Phe 


Gly 


Ser 


Val 


Thr 


Val 


Tyr 


105 










110 






Glu 


Lys 


Arg 


Gly 


Tyr 









125 



<210> 29 

<211> 1106 

<212> DNA 

<213> Homo sapiens 

<400> 29 

cagagctgct gtcatggcgg ccgctctgtg gggcttcttt cccgtcctgc tgctgctgct 60 

gctatcgggg gatgtccaga gctcggaggt gcccggggct gctgctgagg gatcgggagg 120 

gagtggggtc ggcataggag atcgcttcaa gattgagggg cgtgcagttg ttccaggggt 180 

gaagcctcag gactggatct cggcggcccg agtgctggta gacggagaag agcacgtcgg 240 

tttccttaag acagatggga gttttgtggt tcatgatata ccttctggat cttatgtagt 300 

ggaagttgta tctccagctt acagatttga tcccgttcga gtggatatca cttcgaaagg 360 

aaaaatgaga gcaagatatg tgaattacat caaaacatca gaggttgtca gactgcccta 420 

tcctctccaa atgaaatctt caggtccacc ttcttacttt attaaaaggg aatcgtgggg 480 

ctggacagac tttctaatga acccaatggt tatgatgatg gttcttcctt tattgatatt 540 

tgtgcttctg cctaaagtgg tcaacacaag tgatcctgac atgagacggg aaatggagca 600 

gtcaatgaat atgctgaatt ccaaccatga gttgcctgat gtttctgagt tcatgacaag 660 

actcttctct tcaaaatcat ctggcaaatc tagcagcggc agcagtaaaa caggcaaaag 720 

tggggctggc aaaaggaggt agtcaggccg tccagagctg gcatttgcac aaacacggca 780 

acactgggtg gcatccaagt cttggaaaac cgtgtgaagc aactactata aacttgagtc 840 

atcccgacgt tgatctctta caactgtgta tgttaacttt ttagcacatg ttttgtactt 900 

ggtacacgag aaaacccagc tttcatcttt tgtctgtatg aggtcaatat tgatgtcact 960 

gaattaatta cagtgtccta tagaaaatgc cattaataaa ttatatgaac tactatacat 1020 

tatgtatatt aattaaaaca tcttaatcca gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1080 

armaaamgcg ggcgcggggg cgasky 1106 

<210> 30 
<211> 242 
<212> PRT 

<213> Homo sapiens 



<400> 30 



Met Ala Ala 


Ala 


Leu 


Trp 


Gly 


Phe 


Phe 


Pro 


Val 


Leu 


Leu 


Leu 


Leu 


Leu 


1 




5 










10 










15 




Leu Ser Gly 


Asp 


Val 


Gin 


Ser 


Ser 


Glu 


Val 


Pro 


Gly 


Ala 


Ala 


Ala 


Glu 




20 










25 










30 






Gly Ser Gly 


Gly 


Ser 


Gly 


Val 


Gly 


He 


Gly 


Asp 


Arg 


Phe 


Lys 


He 


Glu 


35 










40 










45 








Gly Arg Ala 


Val 


Val 


Pro 


Gly 


Val 


Lys 


Pro 


Gin 


Asp 


Trp 


He 


Ser 


Ala 


50 








55 










60 










Ala Arg Val 


Leu 


Val 


Asp 


Gly 


Glu 


Glu 


His 


Val 


Gly 


Phe 


Leu 


Lys 


Thr 


65 






70 










75 










80 


Asp Gly Ser 


Phe 


Val 


Val 


His 


Asp 


He 


Pro 


Ser 


Gly 


Ser 


Tyr 


Val 


Val 






85 










90 










95 




Glu Val Val 


Ser 


Pro 


Ala 


Tyr 


Arg 


Phe 


Asp 


Pro 


Val 


Arg 


Val 


Asp 


He 
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100 105 110 



Thr 


Ser 


Lys 


Gly Lys 


Met 


Arg 


Ala 


Arg 


Tvr 


Val 


Asn 


Tyr 








115 








120 










125 


Ser 


Glu 


Val 


Val Arg Leu 


Pro 


Tyr 


Pro 


Leu 


Gin 


Met 


Lys 


otsj. OCX yjMy 




130 








135 










140 




Pro 


Pro 


Ser Tyr Phe 


j-ie 


Lys Arg Glu 


Ser 


Trp Gly Trp 


Tlnir 7i^r\ PHo 

1 llX CUV 


145 








1 c;a 
J. D (J 










155 






160 


Leu 


Met 


Asn 


Pro Met 


Val 


Met 


Met 


Met 


Val 


Leu 


Pro 


Leu 


Ijcu irns 








165 










170 










Val 


Leu 


Leu 


Pro Lys 


Val 


Val 


Asn 


Thr 


Ser 


Asp 


Pro 


Asp 


Mpf Atct Am 








180 








185 










190 


Glu 


Met 


Glu 


Gin Ser 


Met 


Asn 


Met 


Leu 


Asn 


Ser 


Asn 


His 


Glu Leu Pro 






195 








200 










205 




Asp 


Val 


Ser 


Glu Phe 


Met 


Thr 


Arg 


Leu 


Phe 


Ser 


Ser 


Lys 


Ser Ser Gly 




210 








215 










220 


Lys 


Ser 


Ser 


Ser Gly 


Ser 


Ser 


Lys 


Thr 


Gly 


Lys 


Ser 


Gly 


Ala Gly Lys 


225 








230 










235 






240 



Arg Arg 



<210> 31 

<211> 2795 

<212> DNA 

<213> Homo sapiens 

<400> 31 

ggagctgaat accctcccag gcacacacag gtgggacaca aataagggtt ttggaaccac 60 
tattttctca tcacgacagc aacttaaaat gcctgggaag atggtcgtga tccttggagc 120 
ctcaaatata ctttggataa tgtttgcagc ttctcaagct tttaaaatcg agaccacccc 180 
agaatctaga tatcttgctc agattggtga ctccgtctca ttgacttgca gcaccacagg 240 
ctgtgagtcc ccatttttct cttggagaac ccagatagat agtccactga atgggaaggt 300 
gacgaatgag gggaccacat ctacgctgac aatgaatcct gttagttttg ggaacgaaca 360 
ctcttacctg tgcacagcaa cttgtgaatc taggaaattg gaaaaaggaa tccaggtgga 420 
gatctactct tttcctaagg atccagagat tcatttgagt ggccctctgg aggctgggaa 4 80 
gccgatcaca gtcaagtgtt cagttgctga tgtataccca tttgacaggc tggagataga 540 
cttactgaaa ggagatcatc tcatgaagag tcaggaattt ctggaggatg cagacaggaa 600 
gtccctggaa accaagagtt tggaagtaac ctttactcct gtcattgagg atattggaaa 660 
agttcttgtt tgccgagcta aattacacat tgatgaaatg gattctgtgc ccacagtaag 720 
gcaggctgta aaagaattgc aagtctacat atcacccaag aatacagtta tttctgtgaa 780 
tccatccaca aagctgcaag aaggtggctc tgtgaccatg acctgttcca gcgagggtct 840 
accagctcca gagattttct ggagtaagaa attagataat gggaatctac agcacctttc 900 
tggaaatgca actctcacct taattgctat gaggatggaa gattctggaa tttatgtgtg 960 
tgaaggagtt aatttgattg ggaaaaacag aaaagaggtg gaattaattg ttcaagcatt 1020 
ccctagagat ccagaaatcg agatgagtgg tggcctcgtg aatgggagct ctgtcactgt 1080 
aagctgcaag gttcctagcg tgtaccccct tgaccggctg gagattgaat tacttaaggg 1140 
ggagactatt ctggagaata tagagttttt ggaggatacg gatatgaaat ctctagagaa 1200 
caaaagtttg gaaatgacct tcatccctac cattgaagat actggaaaag ctcttgtttg 1260 
tcaggctaag ttacatattg atgacatgga attcgaaccc aaacaaaggc agagtacgca 1320 
aacactttat gtcaatgttg cccccagaga tacaaccgtc ttggtcagcc cttcctccat 1380 
cctggaggaa ggcagttctg tgaatatgac atgcttgagc cagggctttc ctgctccgaa 1440 
aatcctgtgg agcaggcagc tccctaacgg ggagctacag cctctttctg agaatgcaac 1500 
tctcacctta atttctacaa aaatggaaga ttctggggtt tatttatgtg aaggaattaa 1560 
ccaggctgga agaagcagaa aggaagtgga attaattatc caagttactc caaaagacat 1620 
aaaacttaca gcttttcctt ctgagagtgt caaagaagga gacactgtca tcatctcttg 1680 
tacatgtgga aatgttccag aaacatggat aatcctgaag aaaaaagcgg agacaggaga 1740 
cacagtacta aaatctatag atggcgccta taccatccga aaggcccagt tgaaggatgc 1800 
gggagtatat gaatgtgaat ctaaaaacaa agttggctca caattaagaa gtttaacact 1860 
tgatgttcaa ggaagagaaa acaacaaaga ctatttttct cctgagcttc tcgtgctcta 1920 



wo 03/000012 



30 



PCTAJS02/19773. 



ttttgcatcc 
caacatgaag 
tgatatgttc 
cttgagaaaa 
atgcccatct 
acagtaactg 
cataatttgt 
actctatatt 
gttaggaaaa 
ttatyccctt 
taaagatgcc 
actttcaggt 
gtactggcat 
cccaatgtgg 
ttgtatagta 



tccttaataa 
gggtcatata 
aactggagac 
acaatgagct 
atgtcccttg 
ccatcaagat 
acatatgtaa 
tagawtgtta 
ttycacatta 
ycacaaaatt 
gggttttata 
actaaatacc 
ggtacggaga 
cctaaaaatg 
aagtgataat 



<210> 32 
<211> 647 
<212> PRT 

<213> Homo sapiens 



tacctgccat 
gtcttgtaga 
actatttatc 
gagaggcaga 
ctgtgagcaa 
gagagaactg 
aataaaatta 
aaawaamyag 
awatttgckg 
ttatycctat 
tttttataga 
tcaacctatg 
tgtttcacga 
caacttcttt 
ttctggaaww 



tggaatgata 
agcacagaaa 
tgtgcaaatc 
cttccctgaa 
gaagtcaaag 
gaggagttcc 
tgccatagca 
tgttgcytgg 
acagctgacc 
atagtttatt 
caaataataa 
gtataatggt 
agtttgttca 
ttattttctt 
aaaaa 



atttactttg 
tcaaaagtgt 
cttgatactg 
tgtattgaac 
taaaacttgc 
ttgatctgta 
agattgctta 
actattataa 
yttgtcatct 
gacaataatt 
gcaaagggag 
tgactgggtt 
tcagactcct 
ttgtaaatgt 



caagaaaagc 
agctaatgct 
ctcatcattc 
ytggaaagaa 
tgcctgaaga 
tatacaataa 
aaatagcaac 
tttaatgcat 
ttctyctatt 
tcaggttttg 
cactgggttg 
tctctgtata 
gtgcaacttt 
ttaggttttt 



1980 
2040 

2100 
2160 
2220 
2280 
2340 
2400 
24 60 
2520 
2580 
2640 
2700 
2760 
2795 



<400> 32 



Met 


Pro 


Gly 


Lys 


Met 


Val 


Val 


T 1 « 

He 


Leu 


Gly Ala 


Ser 


Asn 


He 


Leu 


Trp 


1 








5 










10 










15 




He 


Met 


rne 


Ala 


Ala 


Ser 


Cain 


Ala 


rne 


Lys 


He 


Glu 


1 nr 


Thr 


Pro 


Glu 








Z U 










ZD 










JU 






Ser 


Arg 


Tyr 


Leu 


Ala 


bin 


lie 


biy 


Asp 


Ser 


Val 


Ser 


Leu 


1 nr 


Cys 


Ser 
















H U 


















Thr 


Thr 




Cys 


VjIU 


Ser 


Pro 


rne 


DK A 

rne 


Ser 


Trp 


Arg 


Thr 


Gin 


He 


Asp 




50 




















60 










Ser 


Pro 




Asn 


Gly 


Lys 


Val 


Thr 


Asn 


Glu 


Gly 


Thr 


Thr 


Ser 


Thr 


Leu 


65 










70 










75 










80 


Thr 


Met 


Asn 


Pro 


Val 


Ser 


Phe 


Gly 


Asn 


Glu 


His 


Ser 


Tyr 


Leu 


Cys 


Thr 










85 










90 










95 




Ala 


Thr 


Cys 


Glu 


Ser 


Arg 


Lys 


Leu 


Glu 


Lys 


Gly 


He 


Gin 


Val 


Glu 


He 








100 










105 










110 






Tyr 


Ser 


Phe 


Pro 


Lys 


Asp 


Pro 


Glu 


He 


His 


Leu 


Ser 


Gly 


Pro 


Leu 


Glu 






115 










120 










125 








Ala 


Gly 


Lys 


Pro 


He 


Thr 


Val 


Lys 


Cys 


Ser 


Val 


Ala 


Asp 


Val 


Tyr 


Pro 




130 










135 










140 










Phe Asp 


Arg 


Leu 


Glu 


He 


Asp 


Leu 


Leu 


Lys 


Gly Asp 


His 


Leu 


Met 


Lys 


145 










150 










155 










160 


Ser 


Gin 


Glu 


Phe 


Leu 


Glu 


Asp 


Ala 


Asp 


Arg 


Lys 


Ser 


Leu 


Glu 


Thr 


Lys 










165 










170 










175 




Ser 


Leu 


Glu 


Val 


Thr 


Phe 


Thr 


Pro 


Val 


He 


Glu 


Asp 


He 


Gly 


Lys 


Val 








180 










185 










190 






Leu 


Val 


Cys 


Arg 


Ala 


Lys 


Leu 


His 


He 


Asp 


Glu 


Met 


Asp 


Ser 


Val 


Pro 






195 










200 










205 








Thr 


Val 


Arg 


Gin 


Ala 


Val 


Lys 


Glu 


Leu 


Gin 


Val 


Tyr 


He 


Ser 


Pro 


Lys 




210 










215 










220 










Asn 


Thr 


Val 


He 


Ser 


Val 


Asn 


Pro 


Ser 


Thr 


Lys 


Leu 


Gin 


Glu 


Gly Gly 


225 










230 










235 










240 


Ser 


Val 


Thr 


Met 


Thr 


Cys 


Ser 


Ser 


Glu 


Gly Leu 


Pro 


Ala 


Pro 


Glu 


He 










245 










250 










255 




Phe 


Trp 


Ser 


Lys 


Lys 


Leu 


Asp 


Asn 


Gly 


Asn 


Leu 


Gin 


His 


Leu 


Ser Gly 








260 










265 










270 






Asn 


Ala 


Thr 


Leu 


Thr 


Leu 


He 


Ala 


Met 


Arg 


Met 


Glu Asp Ser Gly 


He 






275 










280 










285 








Tyr 


Val 


Cys 


Glu 


Gly 


Val 


Asn 


Leu 


He 


Gly 


Lys 


Asn Arg 


Lys 


Glu 


Val 
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290 










295 




Glu 


Leu 


He 


Val 


Gin 


Ala 


Phe 


Pro 


305 










310 






Gly Gly 


Leu 


Val 


Asn Gly Ser 


Ser 










325 








Ser 


Val 


Tyr 


Fro 


Leu Asp Arg 


Leu 








34 0 










Thr 


He 


Leu 


r" 1 n 
(slU 


Asn 


J. j.e 


(jlU 


Pne 






c c 

355 










360 


Leu 


Glu 


Asn 


Lys 


Ser 


Leu 


Glu 


Met 




370 










375 




Thr Gly 


Lys 


Ala 


Leu 


val 


Cys 


Gin 


385 










•3 fi A 

390 






Glu 


Phe 


olU 


Pro 


Lys 


Gin 


Arg 


Gin 










405 








Val 


Ala 


Pro 


Arg 


Asp 


Thr 


Thr 


Val 








420 










Glu 


Glu 


Gly 


Ser 


Ser 


Val 


Asn 


Met 






435 










440 


Ala 


Pro 


Lys 


He 


Leu 


Trp 


Ser 


Arg 




450 










455 




Pro 


Leu 


Ser 


Glu 


Asn 


Ala 


Thr 


Leu 


465 










470 






Asp 


Ser 


Gly Val 


Tyr 


Leu 


Cys 


Glu 










485 








Arg Lys 


Glu 


Val 


Glu 


Leu 


He 


He 








500 










Leu 


Thr 


Ala 


Phe 


Pro 


Ser 


Glu 


Ser 






515 










520 


He 


Ser 


Cys 


Thr 


Cys 


Gly Asn 


Val 




530 










535 




Lys 


Lys 


Ala 


Glu 


T K -r- 

1 nr 


Gly 


Asp 


Thr 


545 










550 






Tyr 


Thr 


He 


Arg 


Lys 


Ala 


Gin 


Leu 










565 








Glu 


Ser 


Lys 


Asn 


Lys 


Val 


Gly 


Ser 








580 










Val 


Gin 


Gly Arg 


Glu 


Asn 


Asn 


Lys 






595 










600 


Val 


Leu 


Tyr 


Phe 


Ala 


Ser 


Ser 


Leu 




610 










615 




He 


Tyr 


Phe 


Ala 


Arg 


Lys 


Ala 


Asn 


625 










630 






Glu 


Ala 


Gin 


Lys 


Ser 


Lys 


Val 





645 



300 



Arg 


Asp 


Pro 


blU 


lie 


Glu 


Met 


Ser 






Jl D 










320 


vai 


i nr 


vai 


Ser 


Cys 


Lys 


Val 


Pro 














335 




m 1-1 

vjIU 


X'S a. 

lie 


(slU 


Leu 


Leu 


Lys 


Gly Glu 


"7 A 










Q C A 






Leu 


r*! in 
VjlU 


Asp 


fTI K V- 

1 nr 


Asp 


Met 


Lys 


Ser 










T ^ c 
ODD 








rp W 

1 nr 


rne 


lie 


Pro 


Thr 


He 


Glu Asp 








jO u 










Ala 


Lys 


Leu 


His 


lie 


Asp 


Asp Met 






"5 Q c. 

J y 0 










400 


ber 


inr 


bin 


Thr 


Leu 


Tyr 


Val 


Asn 




4 lU 










415 




Leu 


val 


Ser 


Pro 


Ser 


Ser 


He 


Leu 












430 






Tnr 


Cys 


Leu 


Ser 


Gin 


Gly 


Phe 


Pro 










445 








Gin 


Leu 


Pro 


Asn 


Gly Glu 


Leu 


Gin 








/I C A 

4 bU 










Thr 


Leu 


He 


Ser 


Thr 


Lys 


Met 


Glu 






Aid 

4 ID 










480 


Gly 


He 


Asn 


Gin 


Ala 


Gly 


Arg 


Ser 




^ yu 










4 95 




(jln 


Val 


Thr 


Pro 


Lys Asp 


He 


Lys 


DUO 










510 






val 


Lys 


Glu 


Gly 


Asp 


Thr 


Val 


He 










525 








Pro 


Glu 


Thr 


Trp 


He 


He 


Leu 


Lys 








54 0 










Val 


Leu 


Lys 


Ser 


He 


Asp 


Gly 


Ala 






c c c 

55 5 










560 


Lys 


Asp 


Ala 


Gly 


Val 


Tyr 


Glu 


Cys 




O /U 










575 




Gin 


Leu 


Arg 


Ser 


Leu 


Thr 


Leu Asp 


585 










590 






Asp 


Tyr 


Phe 


Ser 


Pro 


Glu 


Leu 


Leu 










605 








He 


He 


Pro 


Ala 


He 


Gly 


Met 


He 








620 










Met 


Lys 


Gly 


Ser 


Tyr 


Ser 


Leu 


Val 






635 










640 



<210> 33 
<211> 1375 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> 1327, 1328, 1329, 1330, 
1337, 1338, 1339, 1340, 1341, 
1347, 1348, 1349, 1350, 1351, 
1357, 1358, 1359, 1360, 1361, 
<223> n = A,T,C or G 



1331, 1332, 1333, 1334, 1335, 1336, 

1342, 1343, 1344, 1345, 1346, 

1352, 1353, 1354, 1355, 1356, 

1362, 1363, 1364, 1365 
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<221> misc^feature 

<222> 1366, 1367, 1368, 1369, 1370, 1371, 1372 
<223> n = A,T,C or G 



<400> 33 

gagtcgaccc acgcgtccgc ccacgcgtcc gagcggtctg gacagcgcgt ggccggcgcc 60 

gctgtgggga cagcatgagc ggcggttgga tggcgcaggt tggagcgtgg cgaacagggg 120 

ctctgggcct ggcgctgctg ctgctgctcg gcctcggact aggcctggag gccgccgcga 180 

gcccgctttc caccccgacc tctgcccagg ccgcaggccc cagctcaggc tcgtgcccac 24 0 

ccaccaagtt ccagtgccgc accagtggct tatgcgtgcc cctcacctgg cgctgcgaca 300 

gggacttgga ctgcagcgat ggcagcgatg aggaggagtg caggattgag ccatgtaccc 360 

agaaagggca atgcccaccg ccccctggcc tcccctgccc ctgcaccggc gtcagtgact 420 

gctctggggg aactgacaag aaactgcgca actgcagccg cctggcctgc ctagcaggcg 480 

agctccgttg cacgctgagc gatgactgca ttccactcac gtggcgctgc gacggccacc 540 

cagactgtcc cgactccagc gacgagctcg gctgtggaac caatgagatc ctcccggaag 600 

gggatgccac aaccatgggg ccccctgtga ccctggagag tgtcacctct ctcaggaatg 660 

ccacaaccat ggggccccct gtgaccctgg agagtgtccc ctctgtcggg aatgccacat 720 

cctcctctgc cggagaccag tctggaagcc caactgccta tggggttatt gcagctgctg 780 

cggtgctcag tgcaagcctg gtcaccgcca ccctcctcct tttgtcctgg ctccgagccc 840 

aggagcgcct ccgcccactg gggttactgg tggccatgaa ggagtccctg ctgctgtcag 900 

aacagaagac ctcgctgccc tgaggacaag cacttgccac caccgtcact cagccctggg 960 

cgtagccgga caggaggaga gcagtgatgc ggatgggtac ccgggcacac cagccctcag 1020 

agacctgagc tcttctggcc acgtggaacc tcgaacccga gctcctgcag gaagtggccc 1080 

tggagattga gggtccctgg acactcccta tggagatccg gggagctagg atggggaacc 114 0 

tgccacagcc agaactgagg ggctggcccc aggcagctcc cagggggtag aacggccctg 1200 

tgcttaagac actcctgctg ccccgtctga gggtggcgat taaagttgct tcacatcctc 1260 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaargg gcggcccgct 1320 

agactannnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngaa 1375 

<210> 34 
<211> 282 
<212> PRT 

<213> Homo sapiens 
<400> 34 

Met Ser Gly Gly Trp Met Ala Gin Val Gly Ala Trp Arg Thr Gly Ala 

15 10 15 

Leu Gly Leu Ala Leu Leu Leu Leu Leu Gly Leu Gly Leu Gly Leu Glu 

20 25 30 

Ala Ala Ala Ser Pro Leu Ser Thr Pro Thr Ser Ala Gin Ala Ala Gly 

35 40 45 

Pro Ser Ser Gly Ser Cys Pro Pro Thr Lys Phe Gin Cys Arg Thr Ser 

50 55 60 

Gly Leu Cys Val Pro Leu Thr Trp Arg Cys Asp Arg Asp Leu Asp Cys 
65 70 75 80 

Ser Asp Gly Ser Asp Glu Glu Glu Cys Arg lie Glu Pro Cys Thr Gin 

85 90 95 

Lys Gly Gin Cys Pro Pro Pro Pro Gly Leu Pro Cys Pro Cys Thr Gly 

100 105 110 

Val. Ser Asp Cys Ser Gly Gly Thr Asp Lys Lys Leu Arg Asn Cys Ser 

115 120 125 

Arg Leu Ala Cys Leu Ala Gly Glu Leu Arg Cys Thr Leu Ser Asp Asp 

130 135 140 

Cys lie Pro Leu Thr Trp Arg Cys Asp Gly His Pro Asp Cys Pro Asp 
145 150 155 160 

Ser Ser Asp Glu Leu Gly Cys Gly Thr Asn Glu lie Leu Pro Glu Gly 

165 170 175 

Asp Ala Thr Thr Met Gly Pro Pro Val Thr Leu Glu Ser Val Thr Ser 
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180 185 190 



Leu 


Arg 


Asn 


a 1 ^ 
/iia 


Thr 


Thr Met 


biy 


Pro 


fro 


va J. 


Thr 


Leu 
205 


Glu 


Ser 


Val 


Pro 


Ser 


vajL 




Asn 


Ala Thr 


oer 


C r- 

oer 


oer 


MX a 


Gly Asp Gin 


Ser 


Gly 




210 








215 










220 










Ser 


Pro 


Tnr 


Ala 


Tyr Gly Val 


lie 


Ala 


A±a 


Ala 


Ala 


Val 


Leu 


Ser 


Ala 


225 










230 








235 










240 


Ser 


Leu 


Val 


Thr 


Ala 
245 


Thr Leu 


Leu 


Leu 


Leu 
250 


Ser 


Trp 


Leu 


Arg 


Ala 
255 


Gin 


Glu Arg 


Leu 


Arg 


Pro 


Leu Gly 


Leu 


Leu 


Val 


Ala 


Met 


Lys 


Glu 


Ser 


Leu 








260 








265 










270 






Leu 


Leu 


Ser 
275 


Glu 


Gin 


Lys Thr 


Ser 
280 


Leu 


Pro 















<210> 35 

<211> 1798 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> 6, 7 

<223> n = A, T,C or G 
<400> 35 

ttcgannggc cgcccgggca ggtacctcaa atttttaggg gagggtgggt tagggactga 60 

tactcagatt gtggataata attgaattgg tttttaaagg caacatagca ttctacagca 120 

gggttaatct attatcaaga acagtcaccc tggttaataa caagttttac tgatcagttg 180 

ctggttggtt ggttggttgg catgtgggtg tgtgggtgta taggtgtgtg tgggtgtgtg 240 

tgtgtatttt tccccatgag tccttttttt aatcctgtgg ctttttcact tacaactagc 300 

ctaaccctgt aattttccta catccaagaa aacaatcaca aagtagtggt ttaaatactt 360 

tgttgtattt ggctaatttt gctgtcttaa tgcagcctat taagagttgg gttaaaaatc 420 

agtaatcagt actttattac atcactgaac taaaatatgg agacatcctc attgaaaatg 480 

gagggcactc tatcagtcta taactatcaa cgtagtgcaa cagggtgttt tgataccttt 540 

gttttcacct cttgacataa tgctatttaa aggcttgaat ttttcccttt atataatttt 600 • 

cacctttact ttcaaagtgt tttgttgtag ttggctattg cagagagtgc attgtcctat 660 

cattcctaaa cctggtctgc tttctacatt catggtatgg aaaccatgtg attctttgta 720 

cagtttatcc tgatgttgct tgtaatgcag tagaggctat ttcgccttcg cttttctttc 780 

tcgacctttt tgtaaaccct ataattatga agcgattgct tgagaaaata acatataaac 840 

atagaataga atagactgac caagatggtt cacagtttct ttttttaact aggttattta 900 

taatgtattt ctgaaccact tggcagacaa attcacaaca cttaatgttc atattttgag 960 

taaaggaagc taaaaccatg tttgctttct ggtactacat gcattagcga aaggttaagt 1020 

aagttttgtt ctccactgaa gtaatactta acatctcaga aaaaattttg catgttctgt 1080 

agttttgtat taaatcagtc atttcatatg cactatatca agtacaaaca ggtagtttac 1140 

ctgtttatag tagtgtacta acaaagtctc ccttgcagct tcagactgtt atctataggc 1200 

ttatcgttca aatacagcac ttgaatatcc caagtagttc ttctacgcat agctcacctt 1260 

tctaaaccca gttaagcatg gaagagaggt agtaggtagg tgcagtgtgt ggaagctgca 1320 

aacaagtagg ccttttattc attgatatct tttcccaagt actggatttt aaatctgwat 1380 

gtatctgttt gatttttttt tctaatattt cagttgagct gctgttttct tccatgcaat 1440 

attgtatact caattgtgta tagaagaagc tggtgagagt gccctcctac ataaataagc 1500 

aattgcagtg ttttgcatgc aaaatataaa aaatttaaat tgtcctgatt ctattttgta 1560 

aatggagaaa caatcatatc tttctaagcg gtaatggagg aagactagtg ctttgtgcat 1620 

tttgatatat ttgagttcat tttttccaca atgtcatact tttgacgcag ttgggtttct 1680 

catargtatc ctagttcatg tacatccgaa tgctaaataa tactgtgttt taagttttgt 1740 

gttgcaagaa caaatggaat aaacttgaat tgtgctacaa aaaaaaaaaa aaaaaaaa 1798 



<210> 36 
<211> 57 
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<212> PRT 

<213> Homo sapiens 

<400> 36 

Met Leu Phe Lys Gly Leu Asn Phe Ser Leu Tyr He He Phe Thr Phe 

15 10 15 

Thr Phe Lys Val Phe Cys Cys Ser Trp Leu Leu Gin Arg Val His Cys 

20 25 30 

Pro He He Pro Lys Pro Gly Leu Leu Ser Thr Phe Met Val Trp Lys 

35 40 45 

Pro Cys Asp Ser Leu Tyr Ser Leu Ser 
50 55 



<2.10> 37 

<211> 3113 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 68, 92, 94, 106, 145 

<223> n = A,T,C or G 

<400> 37 

gacccacgms yccgcgtcgt ccgcgcgtcg 
gcctcgcntg gtccgaattc ggtggcgcca 
gcggcttcgg cggcttccac ctagnacacc 
ggggtcggca cggggagtcg ggcggtcttg 
tcggacatcg gagactggtt caggagcatc 
accgtcgccg tgcccttggt cggcaaactc 
tggcccgaag ccttccttta tcgctttcag 
ttccctgtgg gtccaggaac tggatttctt 
tattctacgc gacttgaaac aggagctttt 
ctcctcttta actggatttg catcgtgatt 
atgattcctc tgatcatgtc agtactttat 
gtatcatttt ggtttggaac acgatttaag 
ttcaactata tcatcggagg ctcggtaatc 
ctttattttt tcctaatgtt cagataccca 
acacctcagt ttttgtaccg ctggctgccc 
gtgccccctg ctagcatgag gcgagctgct 
tggggccagg gctttcgact tggagaccag 
tcaagccaca tttcctccca gtgctgggtg 
ttggacctga cccacactga atgtagtctt 
aagaaaaata taagtgttcc acaagtttca 
agaacaaata ccaactgtgc aaattgcaaa 
tcccctttcc gtctgaataa tgggttttag 
gctgggtcac caaacccttc ccaaaaggac 
tcccactttt cccaaccccc acatttgcaa 
cccttgacag gttctgttat ttattgactt 
cacgtaattt tccccctttg gtggcagaac 
gatgaagcgt tttctcagct tttggaattg 
ccacttcttc agatattttt ataaaaaagt 
tattaatgag atacgagggt tgttgctggg 
tgtagtggag ttgcagctaa catgggttag 
cgtttcatat gtaggcctac tggctttgtg 
ggaggatcca gatcatgttg gctacaggga 
gattccattt caatctcatt ctggatatgt 
tacgctattt aaatgtcact tttttgccta 



ccggaagggg aagtttcgcc tcagaaggct 60 
cngntccgcc cgtctnccgc cttctgcatc 120 
taacagtcgc ggagccggcc gcgtcgtgag 180 
tgcatcttgg ctacctgtgg gtcgaagatg 240 
ccggcgatca cgcgctattg gttcgccgcc 300 
ggcctcatca gcccggccta cctcttcctc 360 
atttggaggc caatcactgc caccttttat 420 
tatttggtca atttatattt cttatatcag 480 
gatgggaggc cagcagacta tttattcatg 540 
actggcttag caatggatat gcagttgctg 600 
gtctgggccc agctgaacag agacatgatt 660 
gcctgctatt taccctgggt tatccttgga 720 
aatgagctta ttggaaatct ggttggacat 780 
atggacttgg gaggaagaaa ttttctatcc 840 
agtaggagag gaggagtatc aggatttggt 900 
gatcagaatg gcggaggcgg gagacacaac 960 
tgaaggggcg gcctcgggca gccgctcctc 1020 
cgcttaacaa ctgcgttctg gctaacactg 1080 
tcagtacgag acaaagtttc ttaaatcccg 1140 
cgattctcat tcaagtcctt actgctgtga 1200 
actgactaca ttttttggtg tcttctcttc 1260 
cgggtcctag tctgctggca ttgagctggg 1320 
ccttatctct ttcttgcaca catgcctctc 1380 
ctagaagagg ttgcccataa aattgctctg 1440 
ttgccaaggc ttggtcacaa caatcatatt 1500 
tgtagcaata gggggagaag acaagcagcg 1560 
cttcgacctg acatccgttg taaccgtttg 1620 
accactgagt cagtgagggc cacagattgg 1680 
tgtttgtttc ctgagctaag tgatcaagac 1740 
gtttaaacca tgggggatgc aacccctttg 1800 
tagctggagt agttgggttg ctttgtgtta 1860 
gatgctctct ttgagaggct cctgggcatt 1920 
gttcattgag taaaggagga gagaccctca 1980 
tcccccgttt tttggtcatg tttcaattaa 2040 
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ttgtgaggaa ggcgcagctc ctctctgcac gtagatcatt ttttaaagct aatgtaagca 2100 
catctaaggg aataacatga tttaaggttg aaatggcttt agaatcattt gggtttgagg 2160 
gtgtgttatt ttgagtcatg aatgtacaag ctctgtgaat cagaccagct taaataccca 2220 
cacctttttt tcgtaggtgg gcttttccta tcagagcttg gctcataacc aaataaagtt 2280 
ttttgaaggc catggctttt cacacagtta ttttatttta tgacgttatc tgaaagcaga 2340 
ctgttaggag cagtattgag tggctgtcac actttgaggc aactaaaaag gcttcaaacg 2400 
ttttgatcag tttcttttca ggaaacattg tgctctaaca gtatgactat tctttccccc 2460 
actcttaaac agtgtgatgt gtgttatcct aggaaatgag agttggcaaa caacttctca 2520 
ttttgaatag agtttgtgtg tacctctcca tatttaattt atatgataaa ataggtgggg 2580 
agagtctgaa ccttaactgt catgttttgt tgttcatctg tggccacaat aaagtttact 2640 
tgtaaaattt tagaggccat tactccaatt atgttgcacg tacactcatt gtacaggcgt 2700 
ggagactcat tgtatgtata agaatattct gacagtgagt gacccggagt ctctggtgta 2760 
ccctcttacc agtcagctgc ctgcgagcag tcattttttc ctaaaggttt acaagtattt 2820 
agaactcttc agttcagggc aaaatgttca tgaagttatt cctcttaaac atggttagga 2880 
agctgatgac gttattgatt ttgtctggat tatgtttctg gaataatttt accaaaacaa 2940 
gctatttgag ttttgacttg acaaggcaaa acatgacagt ggattctctt tacaaatgga 3000 
aaaaaaaaat ccttattttg tataaaggac ttcccttttt gtaaactaat cctttttatt 3060 
ggtaaaaatt gtaaattaaa atgtgcaact tgaaaaaaaa aaaaaaaaaa aaa 3113 

<210> 38 
<211> 251 
<212> PRT 

<213> Homo sapiens 
<400> 38 



LJCL. OVL. 


r\Sp 


T 1 » 


oxy 


Asp 


Trp 


rne 


Arg 


Ser 


He 


Pro 


Ala 


He 


Thr Arg 


1 






5 










10 










15 


Tyr Trp 


Phe 


Ala 


Ala 


Thr 


Val 


Ala 


Val 


Pro 


Leu 


Val 


Gly 


Lys 


Leu Gly 






20 










25 










30 


Leu lie 


Ser 


Pro 


Ala 


Tyr 


Leu 


Phe 


Leu 


Trp 


Pro 


Glu 


Ala 


Phe 


Leu Tyr 




35 










40 










45 




Arg Phe 


Gin 


He 


Trp 


Arg 


Pro 


He 


Thr 


Ala 


Thr 


Phe 


Tyr 


Phe 


Pro Val 


50 










55 










60 








Gly Pro 


Gly 


Thr 


Gly 


Phe 


Leu 


Tyr 


Leu 


Val 


Asn 


Leu 


Tyr 


Phe 


Leu Tyr 


65 








70 










75 • 






80 


Gin Tyr 


Ser 


Thr 


Arg 
85 


Leu 


Glu 


Thr 


Gly 


Ala 
90 


Phe 


Asp 


Gly 


Arg 


Pro Ala 
95 


Asp Tyr 


Leu 


Phe 
100 


Met 


Leu 


Leu 


Phe 


Asn 
105 


Trp 


He 


Cys 


He 


Val 
110 


He Thr 


Gly Leu 


Ala 
115 


Met 


Asp 


Met 


Gin 


Leu 
120 


Leu 


Met 


He 


Pro 


Leu 

125 


He 


Met Ser 


Val Leu 


Tyr 


Val 


Trp 


Ala 


Gin 


Leu 


Asn 


Arg 


Asp 


Met 


He 


Val 


Ser Phe 


130 










135 










140 








Trp Phe 


Gly 


Thr 


Arg 


Phe 


Lys 


Ala 


Cys 


Tyr 


Leu 


Pro 


Trp 


Val 


He Leu 


145 








150 










155 








160 


Gly Phe 


Asn 


Tyr 


He 
165 


He 


Gly 


Gly 


Ser 


Val 
170 


He 


Asn 


Glu 


Leu 


He Gly 
175 


Asn Leu 


Val 


Gly 
180 


His 


Leu 


Tyr 


Phe 


Phe 
185 


Leu 


Met 


Phe 


Arg 


Tyr 
190 


Pro Met 


Asp Leu 


Gly 


Gly 


Arg 


Asn 


Phe 


Leu 


Ser 


Thr 


Pro 


Gin 


Phe 


Leu Tyr Arg 




195 










200 










205 






Trp Leu 


Pro 


Ser 


Arg 


Arg 


Gly 


Gly 


Val 


Ser Gly 


Phe 


Gly 


Val 


Pro Pro 


210 










215 










220 








Ala Ser 


Met 


Arg 


Arg 


Ala 


Ala 


Asp 


Gin 


Asn 


Gly 


Gly 


Gly 


Gly Arg His 


225 








230 










235 








240 


Asn Trp 


Gly 


Gin 


Gly 
245 


Phe 


Arg 


Leu 


Gly 


Asp 
250 


Gin 
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<210> 39 

<211> 3599 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 3390, 3420 
<223> n A,T,C or G 

<400> 39 

cagtttggga ccaaagccaa agataaccag gttcatatta attacacgga ataggcaaga 60 
aagcatgagc cctgaggagg aaggaaaggg actgtcccag gtgtacttac ctcaaagatg 120 
aagaaatatc aaagacagga aaccctaggt tcttgccctt cagtcgctat ctccttgccc 180 
attagtaaaa tgcggccgat gaatgtcctc acttctgtcc atctgggcag gaggtgggaa 240 
gggtgacgtg caaatggatg ggaggaaccc ttttttcggc agcacccacc acacccagcc 300 
tagtgccacg caccgcaagc gctccataaa cgcacacagc gtcgcswcya csmgkmyscc 360 
gggcggcctt cgcgggattt ctcctggcgt cggctttcag actcccgagg gtgggataaa 4 20 
tcgagagggt ggcatccttt ggcttttctt ctcccaggca gctctgaacc atgtttatgc 480 
aacgtttaat gggctctaat aaaacggcta ataattttga tccgcggaag caccgactcg 540 
ctcgctaagc cgagtctgcg agggtgaagc tgcaactcca acgccggaaa gcgcggctac 600 
cgaaaagcgc atgcgccacg gggtggcacg aagctagagt aagctgagga ggtgggcgga 660 
aaccatggca accatgggtg atgacgacat .ggggagcgtc tctagcgctg gattatgacg 720 
ctggattatg acgcatgcag tgggcgcccg ctctgcggtt cgcttgactg acggcgcagc 780 
ctccgggcct agccacagca gcaacggcag aggccagcgg gcgaggtcaa gatggtggct 840 
ccgcgggcgg gggaggcagt ggagggagga ggagtcagac cttagccagc cggaaacacc 900 
gaaacccaga gacctcctgg ggagccgtcg ccgccgccgc cctctcggcc atcgctgcct 960 
ccgccgcctg ctccacctcg agggacgcga gcgggcggcg gggctggccg tgagagagac 1020 
aggagaggaa ggagggcagg ggcggagttg cccgccttag cccccgcccc cggccgcggc 1080 
cccgggccct gccccgcgcg gccctgcccg gcccaccgag ccctggtgtg gcagcggctc 1140 
atggcggccg tggggccccc gcagcagcag gtgcggatgg cccatcagca ggtctgggcg 1200 
gcgctcgaag tggcgctccg ggtgccctgc ctttacatca tcgacgccat cttcaactcc 1260 
tacccggatt ccagccaaag ccggttctgc atcgtgctcc agatcttcct ccggctcttt 1320 
ggtgtatttg catccagtat tgttctgatc ttgtcacaac gatcactttt caagttttac 1380 
acgtacagct cagcctttct gttagctgca acttcagtgt tggtgaatta ttatgcttct 1440 
ttgcacattg acttctatgg tgcctacaac acgtcagctt ttggaattga gctgcttcct 1500 
cgaaaaggtc cctcgctgtg gatggcactt atcgttctac agctaacatt tggaattgga 1560 
tacgttacac tactccagat tcattccatc tattcacaat taattatttt ggatctcttg 1620 
gttcctgtaa taggcttaat cacagagcta ccattacaca tcagagagac tttactgttt 1680 
acttcttcct tgattctcac attaaataca gtgtttgtcc tggcagtgaa actgaagtgg 1740 
ttttattatt ccacacgata tgtttatctt ttggtgaggc acatgtatcg aatttatgga 1800 
ttacagttat tgatggagga cacatggaag aggattcgtt tcccagacat actacgagtc 1860 
ttttggctaa caagagttac agctcaggct acagtgttaa tgtacatctt aaggatggca 1920 
aatgaaactg attccttctt tatttcttgg gatgattttt gggacctcat ttgcaatctt 1980 
ataattagtg ggtgcgattc tacactaact gtactgggca tgagtgctgt aatttcctca 2040 
gtagcccatt atttggggct tggaatattg gcctttattg gatcaactga ggaagatgac 2100 
aggcgtcttg gctttgttgc acctgtttta ttttttattt tggctcttca gactgggtta 2160 
agtgggctaa gaccagaaga gagacttatt cgcttaagta gaaacatgtg ccttttatta 2220 
actgcagtcc tgcattttat ccatggaatg acagaccctg tattaatgtc tctcagtgcc 2280 
tctcatgtgt catct'tttcg tagacatttt cctgtgctgt ttgtctctgc ttgcctgttt 2340 
attcttcctg tcttactcag ttatgttctt tggcatcact atgcactaaa tacatggttg 2400 
tttgcagtta cagcattttg tgtggaactg tgcttaaaag taattgtttc tctcactgtt 24 60 
tatacgttat tcatgattga tggctactat aatgtcctct gggaaaagct tgacgattat 2520 
gtctactacg ttcgttcaac aggcagtatt attgaattta tatttggagt tgtaatgttt 2580 
ggaaatgggg cttacactat gatgtttgag tcgggaagta aaattcgggc ttttatgatg 2640 
tgcctacatg catattttaa catctactta caagccaaaa atggctggaa gacatttatg 2700 
aatcgtagga ctgctgtgaa gaaaattaat tcacttcctg aaataaaagg gagccgctta 27 60 
caagaaataa atgatgtatg tgcaatctgc tatcatgagt ttacaacatc tgctcgtatt 2820 
acaccgtgta atcattattt ccatgcactt tgccttcgga aatggctgta cattcaagat 2880 
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acttgtccaa tgtgccatca gaaagtatac atcgaagatg atatcaagga taattcaaat 2940 
gtatctaaca acaatggatt tattccaccc aatgaaactc cagaggaagc tgtaagagaa 3000 
gctgctgctg aatctgacag ggaattgaac gaagatgaca gtacagattg tgatgatgat 3060 
gttcaaagag aaagaaatgg agtgattcag cacacaggcg cagcagctga agaatttaat 3120 
gatgatactg actgatgaaa atagcattta ttaatgattg aggtatttgt ttaaaattca 3180 
gttcatccaa aatggagtaa tatccttcac cttcagtgtg taaccaagca caaaaacagt 3240 
atcaatgttg aatctgtgaa tggttttccg tttactgtga tgtgctactg taaatatacc 3300 
tctttaatta cttctggtct ctttggtgac ctgtttaaat ttgtgtacat tattgtacat 3360 
agaataaaat gttttcacat ttttatgacn aaaawwwraa caaatagctt tttaatagan 3420 
tgtaatgatc atatggtgcg tcacctgtgc caaatattct tcaatgaaat tatataatgt 3480 
aactttggac ctcagttttt ctttagaaat gggtgggaga atgaaaatgc aaatcaggaa 3540 
accacattaa agtcaaggaa ataaaataat ttgaccagag gataaaggac atgagagag 3599 

<210> 40 

<211> 664 

<212> PRT 

<213> Homo sapiens 



<400> 40 



Met 


Ala 


Ala 


Val 


Gly 


Pro 


Pro 


Gin 


Gin 


Gin 


Val 


Arg 


Met 


Ala 


His 


Gin 


1 








5 










10 










15 




Gin 


Val 


Trp 


Ala 


Ala 


Leu 


Glu 


Val 


Ala 


Leu 


Arg 


Val 


Pro 


Cys 


Leu 


Tyr 








20 










25 










30 






He 


He 


Asp 


Ala 


He 


Phe 


Asn 


Ser 


Tyr 


Pro 


Asp 


Ser 


Ser 


Gin 


Ser 


Arg 






35 










40 










45 








Phe 


Cys 


He 


Val 


Leu 


Gin 


He 


Phe 


Leu 


Arg 


Leu 


Phe 


Gly 


Val 


Phe 


Ala 




50 










55 










60 










Ser 


Ser 


He 


Val 


Leu 


He 


Leu 


Ser 


Gin 


Arg 


Ser 


Leu 


Phe 


Lys 


Phe 


Tyr 


c c 
DO 










70 










75 










80 


Thr 


Tyr 


Ser 


Ser 


Ala 


Phe 


Leu 


Leu 


Ala 


Ala 


Thr 


Ser 


Val 


Leu 


Val 


Asn 










85 










90 










95 




Tyr 


Tyr 


Ala 


Ser 


Leu 


His 


He 


Asp 


Phe 


Tyr 


Gly 


Ala 


Tyr 


Asn 


Thr 


Ser 








1 no 

1 U u 










J. u o 










1 1 u 






Ala 


Phe 


Gly 


He 


Glu 


Leu 


Leu 


Pro 


Arg 


Lys 


Gly 


Pro 


Ser 


Leu 


Trp 


Met 






115 










120 










125 








Ala 


Leu 


He 


Val 


Leu 


Gin 


Leu 


Thr 


Phe 


Gly 


He 


Gly 


Tyr 


Val 


Thr 


Leu 




130 










135 










140 










Leu 


Gin 


He 


His 


Ser 


He 


Tyr 


Ser 


Gin 


Leu 


He 


He 


Leu 


Asp 


Leu 


Leu 


145 










150 










155 










160 


Val 


Pro 


Val 


He 


Gly 


Leu 


He 


Thr 


Glu 


Leu 


Pro 


Leu 


His 


He 


Arg 


Glu 










165 










170 










175 




Thr 


Leu 


Leu 


Phe 


Thr 


Ser 


Ser 


Leu 


He 


Leu 


Thr 


Leu 


Asn 


Thr 


Val 


Phe 








180 










185 










190 






Val 


Leu 


Ala 


Val 


Lys 


Leu 


Lys 


Trp 


Phe 


Tyr 


Tyr 


Ser 


Thr 


Arg 


Tyr 


Val 






195 










200 










205 








Tyr 


Leu 


Leu 


Val 


Arg 


His 


Met 


Tyr 


Arg 


He 


Tyr 


Gly 


Leu 


Gin 


Leu 


Leu 




210 










215 










220 










Met 


Glu 


Asp 


Thr 


Trp 


Lys 


Arg 


He 


Arg 


Phe 


Pro 


Asp 


He 


Leu 


Arg 


Val 


225 










230 










235 










240 


Phe 


Trp 


Leu 


Thr 


Arg 


Val 


Thr 


Ala 


Gin 


Ala 


Thr 


Val 


Leu 


Met 


Tyr 


He 










245 










250 










255 




Leu 


Arg 


Met 


Ala 


Asn 


Glu 


Thr 


Asp 


Ser 


Phe 


Phe 


He 


Ser 


Trp 


Asp 


Asp 








260 










265 










270 




Phe 


Trp 


Asp 


Leu 


He 


Cys 


Asn 


Leu 


He 


He 


Ser 


Gly 


Cys 


Asp 


Ser 


Thr 






275 










280 










285 








Leu 


Thr 


Val 


Leu 


Gly 


Met 


Ser 


Ala 


Val 


He 


Ser 


Ser 


Val 


Ala 


His 


Tyr 




290 










295 










300 










Leu 


Gly 


Leu 


Gly 


He 


Leu 


Ala 


Phe 


He 


Gly 


Ser 


Thr 


Glu 


Glu 


Asp 


Asp 



305 310 315 320 
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AxQ ftiy Jj6U 




t ne 


vai 


a 1 a 


Pro 


vai 


Leu 


rne 


rne 


He 


Leu 


Ala 
335 


Leu 


fZ^n Thr Glv 
uJ.ri iiii uxy 






oxy 


Leu 


Arg 


Pro 


vjXU 


oXU 


Arg 


Leu 


lie 


Arg 


Leu 




^ *i 




















•3 A 




Ser Arg Asn 


Met 


Cys 


Leu 


Leu 


Leu 


Thr 


Ala 


Val 


Leu 


His 


Phe 


He 


His 


J ■J ^ 










J DU 










c; 








oJ.y ijcu 1111. 




t r L> 


V a X 


Leu 


Mo-h 
lYie L 


o6r 


Leu 


Cor- 

oer 


Axa 


oer 


His 


Val 


Ser 










J / 0 










JoU 










Ph^ Arrr 

JKSl. JTllC r\^\^ 


rti y 


nx o 




D 

r X o 


Va 1 
VaX 


Leu 


Pha 


\/a 1 
V clX 


Ser 


2i 1 IS 

AX a 


L.ys 


T Mil 

lieu 


Phe 








T on 










J^D 










A f\f\ 

400 


A. JuX^ UsSsA ST X. \J 


V ax 


T .on 
fi U D 


Leu 


OCX 


lyr 


Va 1 

VAX 


JL>eu 


irp 


nxs 


nxs 


Tyr 


Ala 
415 


Leu 


&Gn Thy* Tyn 




Pha 


riXa 


VaX 


Thy* 

1 nr 


Hxa 


Dho 

rne 


Cys 


vax 


ulu 


lieu 


Cys 


Leu 




fi z u 










y! 0 










430 




Liy^ vcix xxc 


V ax 


OCX 


Leu 


Th y- 
1 ilx 


Va 1 
VaX 


Tyr 


Th -r 

i nr 


Leu 


Dh A 

rne 


Met 


lie 


Asp 


Gly 












/I zi n 

fi fi u 










/ /I c 
4 4 3 






Tyr Tyr Asn 


Val 


Leu 


Trp 


Glu 


Lys 


Leu 


Asp 


Asp 


Tyr 


Val 


Tyr 


Tyr 


Val 










/4 










>i c r» 










T^'T'rt Qqv T'Wv* 


\3±y 


Ser 


xxe 


xxe 


bXU 


rne 


lie 


rne 


Caly 


Val 


Val 


Met 


Phe 


ft DO 






Airs 
4 / u 










4 /O 










480 


\3±y rvon vjxy 


AX a 


Tyr 


i nr 




Met 


rne 


blU 


Ser 


Caly 


Ser 


Lys 


lie 


Arg 






0 0 










>l Q A 

4 yu 










4 95 


M.±a cne LTicL 


lYie t 
c»nn 


^ys 


Leu 


nxS 


AX a 


Tyr 
c; n c: 


rne 


Asn 


lie 


Tyr 


Leu 
510 


Gin 


Ala 


Lys Asn Gly 


Trp 


Lys 


Thr 


Phe 


Met 


Asn 


Arg 


Arg 


Thr 


Ala 


Val 


Lys 


Lys 


SI s 

3 ± 3 










son 










o c 






lie Asn Ser 


Leu 


Pro 


Glu 


He 


Lys 


Gly 


Ser 


Arg 


Leu 


Gin 


Glu 


He 


Asn 


0 J u 








R 1 c; 










540 










riSp Va± V^y& 


riXa 


ixe 


Cys 


Tyr 


nXS 


bXU 


rne 


<T< W> V 

inr 


Tnr 


Ser 


Ala 


Arg 


He 


OH D 






A 










c c c 

555 








560 


TVi'^ f^\t^ 

mz iTiO ^.^ys 


Asn 


nxS 
0 OD 


Tyr 


rne 


HIS 


Ala 


Leu 

0/0 


Cys 


Leu 


Arg 


Lys 


Trp 
575 


Leu 


xxc 


oou 


1 ilx 




irx O 


Met 


Cys 
585 


nxs 


tjxn 


Lys 


vai 


Tyr 
590 


lie 


Glu 


Asp Asp lie 


Lys 


Asp 


Asn 


Ser 


Asn 


Val 


Ser 


Asn 


Asn 


Asn 


Gly 


Phe 


He 












600 










dUD 








Pro Pro Asn 


Glu 


Thr 


Pro 


V3X U 


Glu 


Ala 


Va 1 
VaX 


Ax y 


oX u. 


a 1 a 
AX a 


2i 1 a 

Axa 


AX a 


r* 1 11 
ialU 


610 








615 










620 










Ser Asp Arg 


Glu 


Leu 


Asn 


Glu 


Asp Asp 


Ser 


Thr 


Asp 


Cys 


Asp 


Asp 


Asp 


625 






630 










635 










640 


Val Gin Arg 


Glu 


Arg 
645 


Asn 


Gly 


Val 


He 


Gin 
650 


His 


Thr 


Gly 


Ala 


Ala 
655 


Ala 


Glu Glu Phe 


Asn 


Asp 


Asp 


Thr 


Asp 



















660 



<210> 41 
<211> 2080 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 34, 85, 95 
<223> n = A, T,C or G 

<400> 41 

cgaccccgcs tccrcmgssr rkkgcgtccg cggnggcgcg gggagagtag ggtgctgtgg 60 
tctgagctag agggtgaagc tggcnggagc agganggatg ggcgagcagt ctgaatgcca 120 
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gaatggataa 
ccatctacat 
aagaaaattc 
atgaaaagac 
ggtgtatcac 
ttgatgtggt 
ttgatcccgg 
agttcctttt 
gtgcttgcat 
caggtctgtg 
agaaactaga 
gtgtctctgc 
accggaaaga 
tttacaattg 
gtttttaatt 
cattcaccac 
gccatctttc 
caaatttatg 
tgttagactt 
gtttctttgg 
cagtcaacca 
tatctaagat 
gctgtttgta 
attttaactt 
aactctccgt 
acataccagg 
aattcctcaa 
tgaccctcca 
ccaactatgg 
cctggtgtgt 
ctataaacat 
attaaaaaaa 
aaattaaaaa 



ccgttttgct 
ggcagcctcc 
cagtgatttg 
ttataatgat 
catacccaaa 
cacaaaatgt 
aaaccacaat 
accttttgtg 
ttgccgaagc 
tacactgggc 
gctccctgac 
tcccttacag 
gtacacctta 
ccatttttat 
tttatttgtg 
taaatacaca 
aaataaacta 
aaagaaaatt 
ttcagtaaaa 
gaaacctctc 
gcagtattag 
cattttccct 
tctgcactgt 
tgtaactgca 
aagattttca 
gttattaaaa 
ctgctattta 
gagctgggaa 
tctttatttc 
atcaacatta 
tgaactttaa 
tgtaacttca 
tgaaaagaga 



acagcatttg 
attggcacag 
aataaaagca 
gcactttttc 
aacatgcatt 
gtgagtttca 
agcgggattg 
agtttaggtt 
ttatatccca 
tcagtaagtt 
aatgtatccg 
ttcatggctt 
atgaaggcat 
ttttttaaaa 
gatataccat 
cttaatacca 
atctaggtct 
gggagtagga 
ctaaagtaac 
atcgtcaaaa 
tgctgttttc 
aagcattgac 
gtctgtctac 
agaattccag 
gtaccaccaa 
gtgtgctttc 
tttattccag 
acctaccaca 
ttgtggtaat 
tttaatgtct 
aaaacttatt 
taatttctta 
aaaaaaaaaa 



taattgcttg 
acttctggta 
tctgggatga 
gatacaatgg 
ggtatagccc 
cactaactga 
atctccttag 
tgatgtgctt 
ccattgccac 
gttatgttgc 
gtgaatttgg 
ctgctctctt 
atcgtgtggc 
taatactgat 
tttattatga 
ctaaaattta 
agacagaaag 
atgtgaccga 
tgtatctgtt 
ctttatgttc 
aaagatttaa 
acatagcttc 
aaaaagtgaa 
ttcagccggg 
attgttttgg 
tttttacatt 
cccagtacta 
agatctaaag 
atgatgtgcc 
tdtaattcag 
tatttattcc 
ccataacctc 
aaaaaaaaac 



tgtgcttagc 
tgaatatcga 
attcattagt 
cacagtggga 
accagaaagg 
gcagttcatg 
gacctatctt 
tggggctttg 
gggcattctc 
tggaattgaa 
atggtccttc 
catctgggct 
atgagcaaga 
attttcccca 
aaatctattt 
tgtggtttac 
aaatggatag 
aaacaagttg 
caactaaaaa 
actttgctgt 
gctctataaa 
atctgaggtg 
aaatacagtg 
cgaggattag 
attttttttc 
atattacagt 
taaagaacgt 
ttctggctgt 
tttccttgcc 
tcattttttt 
actactgtag 
aatgtctttt 



ctcatttcca 
agtccagttc 
gatgaggcag 
ttgtggagac 
acagagtcat 
gagaaatttg 
tggcgttgcc 
atcggacttt 
catctccttg 
ctactccacc 
tgcctggctt 
gctcacacca 
aactgcctgc 
cctctcaatt 
tatttataca 
tttaagcgat 
agacttgaca 
tgctaatgtc 
ctctatatta 
tgtagatagc 
attgggaaat 
agatatggca 
tttacttgaa 
tattattttt 
tttcctcttc 
tacaaggtaa 
ttcaccataa 
ccattaacct 
taaatccctt 
ataagtatgt 
caattgacag 
ttaaaaaata 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2080 



<210> 42 
<211> 253 
<212> PRT 

<213> Homo sapiens 



<400> 42 

Met Asp Asn Arg Phe Ala Thr Ala Phe Val 

15 10 
Leu lie Ser Thr lie Tyr Met Ala Ala Ser 

20 25 
Tyr Glu Tyr Arg Ser Pro Val Gin Glu Asn 

35 40 
Ser lie Trp Asp Glu Phe lie Ser Asp Glu 

50 55 
Asn Asp Ala Leu Phe Arg Tyr Asn Gly Thr 
65 70 

Cys lie Thr lie Pro Lys Asn Met His Trp 

85 90 
Thr Glu Ser Phe Asp Val Val Thr Lys Cys 

100 105 
Glu Gin Phe Met Glu Lys Phe Val Asp Pro 

115 120 
lie Asp Leu Leu Arg Thr Tyr Leu Trp Arg 

130 135 
Phe Val Ser Leu Gly Leu Met Cys Phe Gly 



He Ala Cys Val 
He Gly 
Ser Ser 



Ala Asp 
60 

Val Gly 
75 

Tyr Ser 

Val Ser 

Gly Asn 

Cys Gin 
140 
Ala Leu 



Thr Asp 

30 
Asp Leu 
45 

Glu Lys 



Leu Trp 
Pro Pro 



Phe Thr 
110 
His Asn 

125 

Phe Leu 



Leu Ser 
15 

Phe Trp 

Asn Lys 

Thr Tyr 

Arg Arg 

80 
Glu Arg 
95 

Leu Thr 



Ser Gly 
Leu Pro 
He Gly Leu Cys 
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150 










i. DO 










160 




Cys 


lie Cys 


Arg 


Ser 


Leu 


Tyr 


Pro 


1 nr 


lie 


Ala 


Thr 


Gly 


lie 


Leu 










165 










1 /u 










175 






Leu 


Leu 


Ala 


Gly Leu 


Cys 


i nr 


Leu 


oiy 


Ser 


vai 


Ser 


Cys 


Tyr 


Vai 








180 




















190 






Ala 


Gly 


He 


Glu 


Leu 


Leu 


His 


Gin 


Lys 


Leu 


Glu 


Leu 


Pro Asp Asn 


V dj. 






195 










200 










205 








Ser 


Gly Glu 


Phe 


Gly 


Trp 


Ser 


Phe 


Cys 


Leu 


Ala 


Cys 


Val 


Ser 


Ala 


Pro 




210 










215 










220 










Leu 


Gin 


Phe 


Met 


Ala 


Ser 


Ala 


Leu 


Phe 


He 


Trp 


Ala 


Ala 


His 


Thr 


Asn 


225 










230 










235 










240 


Arg 


Lys 


Glu 


Tyr 


Thr 


Leu 


Met 


Lys 


Ala 


Tyr 


Arg 


Val 


Ala 









245 250 



<210> 43 

<211> 2015 

<212> DNA 

<213> Homo sapiens 

<400> 43 

cttcacccgt ccgtgataag gagatttaag aagtctgagg gtggtgttaa gtttctcaga 60 
acagacgcat atttgcggat gcaattgcag aacaggaaac agaaccaggg agaattttag 120 
gtacccccaa atctcattgg ccctccgcac aagccaagcc acagccactc ctgccacaca 180 
atcggatcgc tttcagcact cgcagccgtg gacagctccc tcgccgcgcg gtcctttcct 24 0 
ctgcagtgag ctgatttgct ctgccagcag ctgtcggtgc cgcgctcgac accgagtcct 300 
agctagcgct cacagaatac gcgctccctc cctccccctt ctctgtcccc cgcctctcgc 360 
tcaccccggc ccactccagc ggcgactttg agggattccc tctctggcgg cctctgcagc 420 
agcacagccg gcctcattcg gggcactgcg agtatggatc tccaaggaag aggggtcccc 480 
agcatcgaca gacttcgagt tctcctgatg ttgttccata caatggctca aatcatggca 540 
gaacaagaag tggaaaatct ctcaggcctt tccactaacc ctgaaaaaga tatatttgtg 600 
gtgcgggaaa atgggacgac gtgtctcatg gcagagtttg cagccaaatt tattgtacct 660 
tatgatgtgt gggccagcaa ct.acgtagat ctgatcacag aacaggccga tatcgcattg 720 
acccggggag ctgaggtgaa gggccgctgt ggccacagcg agtcggagct gcaagtgttc 780 
tgggtggatc gcgcatatgc actcaaaatg ctctttgtaa aggaaagcca caacatgtcc 840 
aagggacctg aggcgacttg gaggctgagc aaagtgcagt ttgtctacga ctcctcggag 900 
aaaacccact tcaaagacgc agtcagtgct gggaagcaca cagccaactc gcaccacctc 960 
tctgccttgg tcacccccgc tgggaagtcc tatgagtgtc aagctcaaca aaccatttca 1020 
ctggcctcta gtgatccgca gaagacggtc accatgatcc tgtctgcggt ccacatccaa 1080 
ccttttgaca ttatctcaga ttttgtcttc agtgaagagc ataaatgccc agtggatgag 1140 
cgggagcaac tggaagaaac cttgcccctg attttggggc tcatcttggg cctcgtcatc 1200 
atggtaacac tcgcgattta ccacgtccac cacaaaatga ctgccaacca ggtgcagatc 1260 
cctcgggaca gatcccagta taagcacatg ggctagaggc cgttaggcag gcacccccta 1320 
ttcctgctcc cccaactgga tcaggtagaa caacaaaagc acttttccat cttgtacacg 1380 
agatacacca acatagctac aatcaaacag gcctgggtat ctgaggcttg cttggcttgt 1440 
gtccatgctt aaacccacgg aagggggaga ctctttcgga tttgtagggt gaaatggcaa 1500 
ttattctctc catgctgggg aggaggggag gagggtctca gacagctttc gtgctcatgg 1560 
tggcttggct ttgactctcc aaagagcaat aaatgccact tggagctgta tctggcccca 1620 
aagtttaggg attgaaaaca tgcttctttg aggaggaaac ccctttaggt tcagaagaat 1680 
atggggtgct ttgctccctt ggacacagct ggcttatcct atacagttgt caatgcacac 1740 
agaatacaac ctcatgctcc ctgcagcaag acccctgaaa gtgattcatg cttctggctg 1800 
gcattctgca tgtttagtga ttgtcttggg aatgtttcac tgctacccgc atccagcgac 1860 
tgcagcacca gaaaacgact aatgtaacta tgcagagttg tttggacttc ttcctgtgcc 1920 
aggtccaagt cgggggacct gaagaatcaa tctgtgtgag tctgtttttc aaaatgaaat 1980 
aaaacacact attctctggc aaaaaaaaaa aaaaa 2015 

<210> 44 
<211> 280 
<212> PRT 
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<213> Homo sapiens 



<400> 44 



Met 


Asp 


Leu 




Gly Arg 


Gly Val 


Pro 


Ser 


He Asp 


Arg 


Leu 


Arg 


Val 


JL 








5 










10 










15 




Leu 


Leu 


Met 


Leu 

c. u 


Phe 


His 


Thr 


Met 


Ala 


Gin 


He 


Met 


Ala 


Glu 
30 


Gin 


Glu 


vax 


Glu 


Asn 


Leu 


Ser Gly 


Leu 


Ser 


1 nr 


Asn 


Pro 


Glu 


Lys 


Asp 


He 


Phe 






35 










4 0 










45 






va± 


Val Arg 


Pill 

blU 


Asn Gly 


Thr 


Thr 


Cys 


Leu 


Met 


Ala 


Glu 


Phe 


Ala 


Ala 




50 










55 










60 










Lys 


Phe 


He 


val 


Pro 


Tyr 


Asp 


Val 


Trp 


Ala 


Ser 


Asn 


Tyr 


Val 


Asp 


Leu 


OD 










70 










75 










80 


T 1 £a 


Thr 


Glu 


bin 


Ala 


Asp 


He 


Ala 


Leu 


Thr Arg 


Gly 


Ala 


Glu 


Val 


Lys 










85 










90 










95 




bxy 


Arg Cys 


Gly 


His 


Ser 


Glu 


Ser 


Glu 


Leu 


Gin 


Val 


Phe 


Trp 


Val 


Asp 








100 










105 










110 




Arg 


Ala 


Tyr 
115 


Ala 


Leu 


Lys 


Met 


Leu 
120 


Phe 


Val- 


Lys 


Glu 


Ser 
125 


His 


Asn 


Met 


Ser 


Lys Gly 


Pro 


Glu 


Ala 


Thr 


Trp Arg 


Leu 


Ser 


Lys 


Val 


Gin 


Phe 


Val 




130 










135 










140 










Tyr 


Asp 


Ser 


Ser 


Glu 


Lys 


Thr 


His 


Phe 


Lys 


Asp 


Ala 


Val 


Ser 


Ala 


Gly 


14 0 










150 










155 










160 


Lys 


His 


Thr 


Ala 


Asn 
165 


Ser 


His 


His 


Leu 


Ser 
170 


Ala 


Leu 


Val 


Thr 


Pro 
175 


Ala 


bxy 


Lys 


Ser 


Tyr 

ion 
1 D U 


Glu 


Cys 


Gin 


Ala 


Gin 
185 


Gin 


Thr 


He 


Ser 


Leu 
190 


Ala 


Ser 


Ser 


Asp 


Pro 
195 


Gin 


Lys 


Thr 


Val 


Thr 
200 


Met 


He 


Leu 


Ser 


Ala 
205 


Val 


His 


He 


Gin 


Pro 


Phe 


Asp 


He 


He 


Ser 


Asp 


Phe 


Val 


Phe 


Ser 


Glu 


Glu 


His 


Lys 




210 










215 










220 








Cys 


Pro 


Val 


Asp 


Glu 


Arg 


Glu 


Gin 


Leu 


Glu 


Glu 


Thr 


Leu 


Pro 


Leu 


He 


225 










230 










235 










240 


Leu 


Gly Leu 


He 


Leu Gly 


Leu 


Val 


He 


Met 


Val 


Thr 


Leu 


Ala 


He 


Tyr 










245 










250 










255 


His 


Val 


His 


His 


Lys 


Met 


Thr 


Ala 


Asn 


Gin 


Val 


Gin 


He 


Pro 


Arg 


Asp 








260 










265 










270 




Arg 


Ser 


Gin 
275 


Tyr 


Lys 


His 


Met 


Gly 
280 



















<210> 45 
<211> 2937 
<212> DNA 

<213> Homo sapiens 



<400> 45 

ttagggagtc 

ctgcaacgga 

ttattgttaa 

tcaggggtcg 

agaaatactt 

ctgtgttata 

cagttttgcc 

agcgctattc 

actttgcacc 

agagcaacgt 

tgttgaccaa 

ttttcattaa 



gacccacgcg 
gagactcaag 
ccctataaac 
ggaccaaggc 
cagcacttgt 
tgaatgttgc 
cattgaccat 
tgacgcctca 
gagtaatgag 
gaatgttgaa 
ggacttaaaa 
ccattatcct 



tccgcggacg 
atgattccct 
gccaacaatc 
ccaaatgtct 
aagaactggt 
cctggttata 
gtttatggca 
aaactgaggg 
gcttgggaca 
ttactgaatg 
aatggcatga 
aatggggttg 



cgtgggcgga 
ttttacccat 
attatgacaa 
gtgcccttca 
ataaaaagtc 
tgagaatgga 
ctctgggcat 
aggagatcga 
acttggattc 
ctttacatag 
ttattccttc 
tcactgttaa 



cgcgtgggtt 
gttttctcta 
gatcttggct 
acagattttg 
catctgtgga 
aggaatgaaa 
cgtgggagcc 
gggaaaggga 
tgatatccgt 
tcacatgatt 
aatgtataac 
ttgtgctcga 



cggggactaa 60 
ctattgctgc 120 
catagtcgta 180 
ggcaccaaaa 240 
cagaaaacga 300 
ggctgcccag 360 
accacaacgc 420 
tccttcactt 480 
agaggtttgg 540 
aataagagaa 600 
aatttggggc 660 
atcatccatg 720 
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ggaaccagat tgcaacaaat 
gtacctcaat tcaagacttc 
ccatcacatc ggacatattg 
ccaccaatga ggcttttgag 
aagtggcttc cgaagctctt 
ctattatggg aggagcagtc 
acggtgacag tataacagta 
ataatggtgt gatccatttg 
ttgagctggc tggaaaacag 
catctgctct gaggccagat 
ctgatgatac tctcagcatg 
tgaaagtaaa agttggcctt 
gcaaacagct cagagtcttc 
agaaagggag taagcaaggg 
cagcagagaa atccctccat 
gcctacttga agctgcagac 
ttgtgccaac caatgatgct 
gggacaaaaa tgctcttcaa 
gaaaaggatt tgaacctggt 
ttctgaaaga agtaaatgat 
tcatgacaac aaatggtgta 
ctgttggaaa tgatcaactg 
agtttgttcg tggagaaaca 
tgaggaagtt gcaagccaac 
gttctcagtg aaaatccaaa 
tgaccttaga aaattgtgag 
agcaatcatc aaataattct 
gagggaaatt gtggagttag 
cacccttttt catcttgaca 
tccttgtcac cagattcatt 
aaagaccgag ccttgtatgt 
ccatgggaag ctaagttata 
gctttgcaca tttctatatg 
ttttgtactc tcagaatgtc 
tcaataaaac catttttcag 
ctcaagattt aagttaaaaa 
tgtaacaagt actcattaaa 



ggtgttgtcc atgtcattga 
attgaagcag aagatgacct 
gaggcccttg gaagagacgg 
aaacttccac gaggtgtcct 
atgaagtacc acatcttaaa 
tttgagacgc tggaaggaaa 
aatggaatca aaatggtgaa 
attgatcagg tcctaattcc 
caaaccacct tcacggatct 
ggagaataca ctttgctggc 
gatcagcgcc tccttaaatt 
aatgagcttt acaacgggca 
gtatatcgta cagctgtctg 
agaaacggtg cgattcacat 
gaaaagttaa aacaagataa 
ttgaaagagc tcctgacaca 
tttaagggaa tgactagtga 
aacatcattc tttatcacct 
gttactaaca ttttaaagac 
acacttctgg tgaatgaatt 
attcatgttg tagataaact 
ctggaaatac ttaataaatt 
gaagaaactc tgaagaaatt 
aaaaaagttc aaggatctag 
aaccagaaaa aaatgtttat 
agccaagttg acttcaggaa 
gaacacaaat ttaatatttt 
cctcctgtgg taaaggaatt 
ttaaaagttc tggctaactt 
acaattcaaa tcgaagagtt 
atgttatgga tacataaaat 
aaaataggtg cttggtgtac 
agtgggttta ctggtaaatt 
atatgcttct tgcaatgcat 
atataaagag aattacttca 
gtggtttgga cttgggaaca 
ggaaattgaa tgaaaaaaaa 



ccgtgtgctt acacaaattg 780 
ttcatctttt agagcagctg 840 
tcacttcaca ctctttgctc 900 
agaaaggatc atgggagaca 960 
tactctccag tgttctgagt 1020 
tacaattgag ataggatgtg 1080 
caaaaaggat attgtgacaa 1140 
tgattctgcc aaacaagtta 1200 
tgtggcccaa ttaggcttgg 1260 
acctgtgaat aatgcatttt 1320 
aattctgcag aatcacatat 1380 
aatactggaa accatcggag 1440 
cattgaaaat tcatgcatgg 1500 
attccgcgag atcatcaagc 1560 
gcgctttagc accttcctca 1620 
acctggagac tggacattat 1680 
agaaaaagaa attctgatac 1740 
gacaccagga gttttcattg 1800 
cacacaagga agcaaaatct 1860 
gaaatcaaaa gaatctgaca 1920 
cctctatcca gcagacacac 1980 
aatcaaatac atccaaatta 2040 
gttacaagaa gacacacccg 2100 
aagacgatta agggaaggtc 2160 
acaaccctaa gtcaataacc 2220 
ctgaaacatc agcacaaaga 2280 
tttttctgaa tgagaaacat 2340 
gaagaaaata taacacctta 2400 
tggaatccat tagagaaaaa 24 60 
gtgaactgtt atcccattga 2520 
gcacgcaagc cattatctct 2580 
aaaacttttt atatcaaaag 2640 
atgttatttt ttacaactaa 2700 
attttttaat ctcaaacgtt 2760 
aattgagtaa ttcagaaaaa 2820 
ggactttata cctcttttac 2880 
aaaaaaaggg cggccgc 2937 
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<210> 47 

<211> 3417 

<212> DNA 

<213> Homo sapiens 

<400> 47 

gccccgcgtc cgcgcctccg ggctccttcg gccccgccat gggctgctgc agctccgcct 60 
cctccgccgc gcagagctcc aaacgagaat ggaagccgct ggaggaccgt agctgcacag 120 

acataccatg gctgctgctc ttcatcctct tctgcattgg gatgggattt atttgtggct 180 
tttcaatagc aacaggtgca gcagcaagac tagtgtcagg atacgacagc tatggaaata 240 
tctgtgggca gaaaaataca aagttggaag caataccaaa cagtggcatg gaccacaccc 300 
agcggaagta tgtattcttt ttggatccat gcaacctgga cttgataaac cggaagatta 360 

agtctgtagc actgtgtgta gcagcgtgtc caaggcaaga actgaaaact ctgagtgatg 420 

ttcagaagtt tgcagagata aatggttcag ccctatgtag ctacaaccta aagccttctg 480 

aatacactac atctccaaaa tcttctgttc tctgccccaa actaccagtt ccagcgagtg 540 

cacctattcc attcttccat cgctgtgctc ctgtgaacat ttcctgctat gccaagtttg 600 
cagaggccct gatcaccttt gtcagtgaca atagtgtctt acacaggctg attagtggag 660 

taatgaccag caaagaaatt atattgggac tttgcttgtt atcactagtt ctatccatga 720 

ttttgatggt gataatcagg tatatatcaa gagtacttgt gtggatctta acgattctgg 780 

tcatactcgg ttcacttgga ggcacaggtg tactatggtg gctgtatgca aagcaaagaa 840 

ggtctcccaa agaaactgtt actcctgagc agcttcagat agctgaagac aatcttcggg 900 

ccctcctcat ttatgccatt tcagctacag tgttcacagt gatcttattc ctgataatgt 960 

tggttatgcg caaacgtgtt gctcttacca tcgccttgtt ccacgtagct ggcaaggtct 1020 

tcattcactt gccactgcta gtcttccaac ccttctggac tttctttgct cttgtcttgt 1080 

tttgggtgta ctggatcatg acacttcttt ttcttggcac taccggcagt cctgttcaga 1140 

atgagcaagg ctttgtggag ttcaaaattt ctgggcctct gcagtacatg tggtggtacc 1200 

atgtggtggg cctgatttgg atcagtgaat ttattctagc atgtcagcag atgacagtgg 1260 

caggagctgt ggtaacatac tattttacta gggataaaag gaatttgcca tttacaccta 1320 

ttttggcatc agtaaatcgc cttattcgtt accacctagg tacggtggca aaaggatctt 1380 

tcattatcac attagtcaaa attccgcgaa tgatccttat gtatattcac agtcagctca 1440 

aaggaaagga aaatgcttgt gcacgatgtg tgctgaaatc ttgcatttgt tgcctttggt 1500 

gtcttgaaaa gtgcctaaat tatttaaatc agaatgcata cacagccaca gctatcaaca 1560 

gcaccaactt ctgcacctca gcaaaggatg cctttgtcat tctggtggag aatgctttgc 1620 

gagtggctac catcaacaca gtaggagatt ttatgttatt ccttggcaag gtgctgatag 1680 

tctgcagcac aggtttagct gggattatgc tgctcaacta ccagcaggac tacacagtat 1740 

gggtgctgcc tctgatcatc gtctgcctct ttgctttcct agtcgctcat tgcttcctgt 1800 

ctatttatga aatggtagtg gatgtattat tcttgtgttt tgccattgat acaaaataca 1860 

atgatgggag ccctggcaga gaattctata tggataaagt gctgatggag tttgtggaaa 1920 

acagtaggaa agcaatgaaa gaagctggta agggaggcgt cgctgattcc agagagctaa 1980 

agccgatggc ttcgggagca agttctgctt gaacctagcc gacggttatg gaaacccatt 2040 

gacattccaa aacaatatat acacataact atgtatttgt gtgtgtgggt gtgtgtatat 2100 

atgtatatgt atgtgtgtat atatgtatat gtatatacac acacacacat aaatcagcca 2160 

aaatcagaga aaaggaacag ggatttaata ccttttttat. gcttattttt gtcaaacatg 2220 
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tactcctttc atacgggtgg cttttacaag gcaacttccg tcatttaatg ttttcaactg 2280 

taattgtctt aatggaaatg ttaaaattca tatctgatta acatttttaa taacttagag 2340 

gagattttaa ctttatttaa aaataggtaa aattattgta cctaattatg tctaaagttt 2400 

attcaggggt aatttccctg atgtctgtat aaaatcaaga tcttatttta ctgatgcata 24 60 

agtcctagtg ggtcaagact aggcatatgc tttcagataa ataaggaatt actccaatca 2520 

gttttcccca atcaaagaag ccatgtcatt ttacttttag aaacatacaa ttgggcccaa 2580 

tatgggaatt ttcataatag ttcatacatt tgtcagccaa cattaaaagg taaccaactc 2640 

ctcaggtatt tgtagtttac cctaacgctt ctttaaaaga aagtaggtaa aaaaagaaaa 2700 

gggtagataa tctttcgtat gcaaactttt cccttatatt ttgtctttct ttcctttttg 2760 

actttagtag catcctccac acatttgtgt gcctgatttg aaaggaagct ggggcaccca 2820 

gcgagtttag cctttaagtt tctgtgtatt gatttgcaga ttaagtaatg ctgagaggaa 2880 

taaagaaggg acagaaacat ggaacataaa gcattgaaaa ttccggtgct tgggcttcgg 2940 

cttcagagta acgtcagtgg cttagggtta aacggccatt ttattcaaat gcttgctata 3000 

caatctgaaa acacactggc aggtgctcct ctccttggca attcattgag tatccagagt 3060 

tctacgatgt ttaactgaag aattggctaa tgttttgatc ctccagtgtg actgttgttt 3120 

ttgtttgggg gtgggtttgg ggttttttgc ttttttattc ctgaagctta ccagatatga 3180 

atggctaata ctccattgtt ctgcttgttg taatggtgaa tgctttaaga aaaaaaagtg 3240 

taatttgcta agaataattc atgatctgtt tatgcgataa ctcctttttg ttacaatttt 3300 

tttaaaaaaa gctatttttg ttaatgtaaa gtaaatattt cagagcaaat tttttaaact 3360 

tattgcacta aatacaggct ctgtacaaaa aaaaaaaaaa agggcggccg ctagact 3417 
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<213> Homo sapiens 
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Arg 


Tyr 


U -i c 

n is 


Leu 






4 Jo 










440 


±xe 


T 1 

lie 


Thr 


Leu 


Vai 


Lys 


He 


Pro 














y| C C 

4 oo 




Ser 


(jin 


Leu 


Lys 


Gly 


Lys 


blU 


Asn 


H DO 










yi T n 
4 /U 






Ser 


Cys 


lie 


Cys 


Cys 


Leu 


Trp 


Cys 










485 








Asn 


Cain 


Asn 


Ala 


Tyr 


Tnr 


Ala 


Thr 








500 












Ser 


Aia 


Lys 


Asp 


ill a 


rne 


vai 






olo 










520 


vai 


Ala 


Tnr 


He 


Asn 


Thr 


Vai 


Gly 




c "3 n 










c c 
OOO 




vai 


Leu 


lie 


Vai 


Cys 


Ser 


Thr 


Gly 


^ /I c; 
0 4 0 










RCA 

ooU 






Tyr 


Gin 


bin 


Asp 


Tyr 


inr 


vai 


Trp 










565 








Leu 


Phe 


Ala 


Phe 


Leu 


Vai 


Aia 


His 








580 










Vai 


Vai 




Vai 


Leu 




Leu 


L»y s 






595 










600 


Asp Gly 


Ser 


Pro Gly 


Arg 


Glu 


Phe 




610 










615 




Phe 


Vai 


Glu 


Asn 


Ser 


Arg 


Lys 


Ala 


625 










630 






Vai 


Ala 


Asp 


Ser Arg 


Glu 


Leu 


Lys 



645 



Ala 



Lys 


bin 


Arg 


Arg 


Ser 


Pro 


Lys 


Glu 


Z 00 










270 






lie 


TV 1 

Ala 


blU 


Asp 


Asn 


Leu 


Arg 


Ala 










o o c 

z8o 








inr 


vax 


rne 


inr 


vai 


He 


Leu 


Phe 








JUU 










Arg 


vai 


Ala 


Leu 


Thr 


He 


Ala 


Leu 






'3 1 C 

Jlo 










320 


lie 


nlS 


Leu 


Pro 


Leu 


Leu 


Vai 


Phe 




J Ju 










335 




Leu 


vai 


Leu 


rne 


Trp 


Vai 


Tyr 


Trp 


J^ 0 










OCA 

JoO 






1 Oi 


i Ili 


bxy 


Ser 


Pro 


vai 


Gin 


Asn 










o <r c 

36o 








lie 


Ser 


Gly 


Pro 


Leu 


Gin 


Tyr 


Met 








"3 O 

380 










lie 


Trp 


lie 


ber 


blU 


Phe 


He 


Leu 






"3 Q C 

jyo 










4 00 


biy 


Ala 


Vai 


Vai 


Thr 


Tyr 


Tyr 


Phe 




410 










415 




rne 


inr 


Pro 


lie 


Leu 


Ala 


Ser 


Vai 


/I o c 
4zo 










430 






biy 


inr 
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Ala 


Lys 


Gly 


Ser 


Phe 










445 








Arg 


Met 


He 


Leu 


Met 


Tyr 


He 


His 
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Ala 


Cys 


Ala 


Arg 


Cys 


Vai 


Leu 


Lys 






4 /o 










480 


Leu 


blU 


Lys 


Cys 


Leu 


Asn 


Tyr 


Leu 
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Ala 


He 


Asn 


Ser 


Thr 


Asn 


Phe 


Cys 
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lie 


Leu 


Vai 


Glu 


Asn 


Ala 


Leu 


Arg 
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525 








Asp 


Pne 


Met 


Leu 


Phe 


Leu 


Gly 


Lys 
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Leu 


Ala 


biy 


T 1 ^ 

lie 


Met 


Leu 


Leu 


Asn 






c c c 
OOO 
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vai 


Leu 


Pro 
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He 


He 


Vai 


Cys 




C T A 

0 /U 










575 




Cys 


rne 


Leu 


Ser 


lie 


Tyr 


blU 


Met 


c o c 

OOO 










590 






Fne 


Ala 


He 


Asp 


Thr 


Lys 


Tyr 


Asn 










605 








Tyr 


Met 


Asp 


Lys 


Vai 


Leu 


Met 


Glu 








620 










Met 


Lys 


Glu 


Ala 


Gly 


Lys 


Gly 


Gly 






635 










640 


Pro 


Met 


Ala 


Ser 


Gly 


Ala 


Ser 


Ser 




650 










655 
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<400> 49 

cctcgcgtcc gcgcacaccg gggtggcagc 
cgcctctgcc cgcaaccgct gagccatcca 
cggcctgggc ggtgctgctg ctgctgctgc. 
cgccgggtcg gggccgtgcc gcggggccgc 
tagatgactg ccatgccgac gccctgtgtc 
gcaagcctgg ctaccaaggg gaaggcaggc 
agctcaatgg aggctgtgtc catgactgtt 
gttttgatgg cttcatgttg gctcatgacg 
tggagaacaa tggcggctgc cagcatacct 
gctgcaagga ggggtttttc ctgagtgaca 
agggcctgag ctgcatgaat aaggatcacg 
ggggcagcgt cgcctgtgag tgcaggcctg 
gcatcttgac ctgtaaccat gggaacggtg 
atggcccaga gtgcagctgc catccacagt 
ttgagcgaga ggacactgtc ctggaggtga 
gggataaacg ggtgaaacgg cggctgctca 
gtgaccgcac ctgtaaggat acttcgacag 
ctctccagtt ggatgggaag acatgtaaag 
gttgtgatca tttctgcaaa aacatcgtgg 
ttaaattatt aacagatgag aagtcttgcc 
cctgtgacca cagctgcatc aaccaccctg 
acaccctgta tggcttcacc cactgtggag 
gctgtcagca ggtctgtgtg aacacagtgg 
acaagctcca ctggaataaa aaagactgtg 
tgtcaccccg tgtgtccctg cactgcggta 
gatgtcactc tggcattcac ctctcttcag 
ttaagctaaa tgaaggcaag tgtagtttga 
gaccagcact accagagaag cacagctcag 
catgcagctc tggcaagcaa gtcccaggag 
tgtttatcac tgttgagttt gagcttgaaa 
acctgagctg catcgtaaag cgaaccgaga 
gaaaggccgt ccacagggag cagtttcacc 
ctaaaaagcc tcccagaaca tctgaacgcc 
atgcagaaaa ccaatgtgtc agttgcaggg 
gctgcatttt atgtccaaat ggaacc.ttcc 
catgcccaag accaggaaat tctggggccc 
aatgtggagg tctgtgtcaa cctggtgaat 
tctgtgccct gggcacgttc cagcctgaag 
gaggccttgc caccaaacat cagggagcta 
aatgttcacc tggacatttc tacaacacca 
gaacatacca gcctgaattt ggaaaaaata 
ctgactttga tggctccaca aacataaccc 
tgggagattt cactgggtac attgaatccc 
ccgagtgtac gtggaccatc aacccacccc 
agatcttcct gcccatagag gacgactgtg 
catccaattc tgtgacaaca tatgaaacct 
cctccaggtc aaagaagctg tggattcagt 
ggttccaggt cccatacgtg acatatgatg 
ttcgagatgg caggctctat gcatctgaga 
ttatcaaggc tctgtttgat gtcctggccc 
aggagtcccg agagatgttt ccaagatcgt 
ggtttttgag accttacaaa tgactcagcc 
tagggttggt gggacagagc tgtcttcctt 
ctcccgtatc agtgactcat tagagttcaa 
attgaacttg gtttttcttt cccagcatcg 
catcagcttc tcactgctgt gggcggatgt 
actttggtca gcctaggtga gactcacctg 
gtctgtagtg gaaaggaggc cacagaataa 



gccgcagcgg gcagggcgcc cgcactccgc 60 
tgggggtcgc gggccgcaac cgtcccgggg 120 
cgccactgct gctgctggcg ggggccgtcc 180 
aggaggatgt agatgagtgt ccgcaagggc 24 0 
agaacacacc cacctcctac aagtgctcct 300 
agtgtgagga catcgatgaa tgtggaaatg 360 
tgaatattcc aggcaattat cgtigca^ctt 420 
gtcataattg tcttgatgtg gacgagtgcc 480 
gtgtcaacgt catggggagc tatgagtgct 540 
atcagcacac ctgcattcac cgctcggaag 600 
gctgtagtca catctgcaag gaggccccaa 660 
gttttgagct ggccaagaac cagagagact 720 
ggtgccagca ctcctgtgac gatacagccg 780 
acaagatgca cacagatggg aggagctgcc 840 
cagagagcaa caccacatca gtggtggatg 900 
tggaaacgtg tgctgtcaac aatggaggct 960 
gtgtccactg cagttgtcct gttggattca 1020 
atattgatga gtgccagacc cgcaatggag 1080 
gcagttttga ctgcggctgc aagaaaggat 1140 
aagatgtgga tgagtgctct ttggatagga 1200 
gcacatttgc ttgtgcttgc aaccgagggt 1260 
acaccaatga gtgcagcatc aacaacggag 1320 
gcagctatga atgccagtgc caccctgggt 1380 
tggaagtgaa ggggctcctg cccacaagtg 1440 
agagtggtgg aggagacggg tgcttcctca 1500 
atgtcaccac catcaggaca agtgtaacct 1560 
aaaatgctga gctgtttccc gagggtctgc 1620 
taaaagagag cttccgctac gtaaacctta 1680 
cccctggccg accaagcacc cctaaggaaa 1740 
ctaaccaaaa ggaggtgaca gcttcttgtg 1800 
agcggctccg taaagccatc cgcacgctca 1860 
tccagctctc aggcatgaac ctcgacgtgg 1920 
aggcagagtc ctgtggagtg ggccagggtc 1980 
ctgggaccta ttatgatgga gcacgagaac 2040 
aaaatgagga aggacaaatg acttgtgaac 2100 
tgaagacccc agaagcttgg aatatgtctg 2160 
attctgcaga tggctttgca ccttgccagc 2220 
ctggtcgaac ttcctgcttc ccctgtggag 2280 
cttcctttca ggactgtgaa accagagttc 2340 
ccactcaccg atgtattcgt tgcccagtgg 2400 
attgtgtttc ttgcccagga aatactacga 24 60 
agtgtaaaaa cagaagatgt ggaggggagc 2520 
caaactaccc aggcaattac ccagccaaca 2580 
ccaagcgccg catcctgatc gtggtccctg 2640 
gggactatct ggtgatgcgg aaaacctctt 2700 
gccagaccta cgaacgcccc atcgccttca 2760 
tcaagtccaa tgaagggaac agcgctagag 2820 
aggactacca ggaactcatt gaagacatag 2880 
accatcagga aatacttaag gataagaaac 2940 
atccccagaa ctatttcaag tacacagccc 3000 
tcatccgatt gctacgttcc aaagtgtcca 3060 
cacgtgccac tcaatacaaa tgttctgcta 3120 
ctgcatgtca gcacagtcgg gtattgctgc 3180 
tttttataga taatacagat attttggtaa 3240 
tggatgtaga ctgagaatgg ctttgagtgg 3300 
cttggataga tcacgggctg gctgagctgg 3360 
tccttctggg gtcttactcc tcctcaagga 3420 
gctgcttatt ctgaaacttc agcttcctct 3480 
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agcccggccc tctctaaggg agccctctgc actcgtgtgc aggctctgac caggcagaac 3540 
aggcaagagg ggagggaagg agacccctgc aggctccctc cacccacctt gagacctggg 3600 
aggactcagt ttctccacag ccttctccag cctgtgtgat acaagtttga tcccaggaac 3660 
ttgagttcta agcagtgctc gtgaaaaaaa aaagcagaaa gaattagaaa taaataaaaa 3720 
ctaagcactt ctggagacac ctataggagt cgtattac 3758 

<210> 50 
<211> 997 
<212> PRT 

<213> Homo sapiens 
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Val 


Ala 


Gly 


Arg 
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Pro Gly Ala 
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Trp Ala Val Leu 
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5 








10 






15 


JLieU Jj6U 


Leu 


Leu 


Pro 


Pro 


Leu Leu 


Leu 


Leu Ala 


Gly Ala 


Val Pro Pro 






20 








25 








30 


Gly Arg 


Gly Arg Ala 


Ala 


Gly Pro 


Gin 


Glu Asp 


Val 


Asp 


Glu Cys Pro 




35 








4 0 








45 




bin bi.y 


Leu Asp Asp 


Cys 


His Ala 


Asp Ala Leu 


Cys 


Gin 


Asn Thr Pro 












55 






60 






'P Wi V O A >• 


Tyr 


Lys 


Cys 


Ser 


Cys Lys 


Pro Gly Tyr 


Gin 


Gly Glu Gly Arg 


00 








70 






75 






80 


bin bys 


Glu 


Asp 


He 


Asp 


Glu Cys 


Gly Asn Glu 


Leu 


Asn 


Gly Gly Cys 








85 








90 






95 


vai nis 


Asp Cys 


Leu 


Asn 


He Pro 


Gly Asn Tyr 


Arg 


Cys 


Thr Cys Phe 






100 








105 








110 


ASp vjiy 


Phe 


Met 


Leu 


Ala 


His Asp 


Gly 


His Asn 


Cys 


Leu 


Asp Val Asp 




115 








120 








125 


Giu Cys 


Leu 


Glu 


Asn 


Asn 


Gly Gly 


Cys 


Gin His 


Thr 


Cys 


Val Asn Val 


1 JU 










135 






140 




neu oiy 


Ser 


Tyr 


Glu 


Cys 


Cys Cys 


Lys 


Glu Gly 


Phe 


Phe 


Leu Ser Asp 


1 4 D 














155 






160 


& c r\ 1 n 


His 


Thr 


Cys 


He 


His Arg 


Ser 


Glu Glu 


Gly Leu 


Ser Cys Met 








165 








170 






175 


&Qn T.\/Q 


Asp 


His 


Gly 


Cys 


ber nis 


He 


Cys Lys 


Glu 


Ala 


Pro Arg Gly 






180 








185 








190 


kjsz^ V d J. 


Ala 


Cys 


Glu 


Cys 


Arg Pro 


Gly 


Phe Glu 


Leu 


Ala 


Lys Asn Gin 
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ODD 
^ U U 








205 




Cys 


He 


Leu 


inr 


Cys Asn 


His 


Gly Asn Gly Gly Cys Gin His 


^ J. U 
















220 






Ser Cys 


Asp Asp 


Thr 


Aia 


t\sp biy 


Pro 


Glu Cys 


Ser 


Cys 


His Pro Gin 


225 








230 






235 




240 


Tyr Lys 


Met 


His 


Thr 


Asp 


Gly Arg 


Ser 


Cys Leu 


Glu 


Arg Glu Asp Thr 








245 








250 






255 


Val Leu 


Glu 


Val 


Thr 


Glu 


Ser Asn 


Thr 


Thr Ser 


Val 


Val 


Asp Gly Asp 






260 








265 








270 


Lys Arg 


Val 


Lys 


Arg 


Arg 


Leu Leu 


Met 


Glu Thr 


Cys 


Ala 


Val Asn Asn 




275 








280 








285 




Gly Gly 


Cys 


Asp Arg 


Thr 


Cys Lys 


Asp Thr Ser 


Thr 


Gly Val His Cys 
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300 






Ser Cys 


Pro 


Val 


Gly 


Phe 


Thr Leu 


Gin 


Leu Asp Gly Lys 


Thr Cys Lys 
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315 
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Asp He 


Asp 


Glu 


Cys 


Gin 


Thr Arg 


Asn Gly Gly Cys Asp His Phe Cys 
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330 






335 


Lys Asn 


He 


Val 


Gly 


Ser 


Phe Asp 


Cys 


Gly Cys 


Lys 


Lys 


Gly Phe Lys 
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345 








350 


Leu Leu 


Thr 


Asp 


Glu 


Lys 


Ser Cys 


Gin 


Asp Val 


Asp 


Glu 


Cys Ser Leu 




355 
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Asp Arg 


Thr 


Cys 


Asp 


His 


Ser Cys 


He 


Asn His 


Pro Gly Thr Phe Ala 
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370 
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Gly 
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Gly 


Phe 


Thr 


His 


Cys 


Gly 
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Gly 
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Val 


Cys 
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1 nr 
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val 


Gly Ser 
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Glu 
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Cys 


His 
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Gly 
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Lys 
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nlS 


Trp 
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Lys 
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Cys 


val 


Glu 


Val 


Lys 


Gly 


Leu 


Leu 


Pro 






/IOC 
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Pro Arg 


Val 
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Leu 
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Cys 
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Lys 


Ser 
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Asp 
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Cys 
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Leu 


Arg 


Cys 
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Ser 


Gly 


He 
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Leu 


Ser 


Ser 
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Asn 
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Val 
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Phe 


Arg 
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Val 
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1 nr 
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Ser 
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Gly 


Lys 
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Val 
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Gly 
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Pro 


Gly 
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Pro 


Ser 


Thr 


Pro 


Lys 


Glu 


Met 


Phe 


He 


Thr 


Val 


Glu 


Phe 


Glu 


Leu 


Glu 












550 










555 










560 


Thr 


Asn 


Gin 


Lys 


Glu 


Val 


Thr 


Ala 


Ser 


Cys 


Asp 


Leu 


Ser 


Cys 


He 


Val 










565 










570 








575 




Lys 


Arg 


Tnr 


Glu 


Lys 


Arg 


Leu 


Arg 


Lys 


Ala 


He 


Arg 


Thr 


Leu 


Arg 


Lys 








580 
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His 


Arg 
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Met 
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Val 
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Pro 


Arg 
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Ser 


Glu 
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Glu 


Ser 




610 
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620 










Cys 


Gly 


Val 
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Gly 
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Val 
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Cys 


Arg 
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630 
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Ala 


Gly 
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Gly 
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Arg 


Glu 
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Cys 


He 


Leu 


Cys 
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Pne 
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Asn 
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Glu 


Gly 


Gin 


Met 
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Cys 
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Cys 
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CjIU 


Cys 


Gly Gly 


Leu 


Cys 
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Gly 
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Ala 


Pro 
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Cys 
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Thr 
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Cys 
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Pro 


Pro 


Lys 


Arg 
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He 
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He 


Val 



835 840 845 
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<400> 51 

ccgcggcgct 

cgcggctgcc 

aaccaccgcg 

gccctctgga 

aggccgcgga 

tgccctctct 

tgagaaaatt 

gcaatatcat 

agggttcaga 

ccatgaccac 

ccatgagcat 

aaataagcga 

aaacagccag 

ggacagtgtt 

tcactttcta 

cagctccact 

aacaaatgaa 

aaatcctcag 

ggttccgctg 

tgctagatct 

ttcattacaa 

tctgctgggg 

tttccttgtg 

acattctcat 

aagaggacca 

ttccacgtgg 

tgtcctcaca 

aaatgatgat 

aaatgaggag 

agagccctcc 

agctcatgct 



gcgcgcggcg 
gggtggctgc 
tgttcgcgcc 
ccgcccgtgt 
gacgaaggcg 
gtcacaaatc 
agtccgaatt 
ctacaacagc 
aaattacttc 
gaccatcact 
cactcagacc 
aaagctcttt 
gggaaaggag 
agtgctagtg 
gagacaatag 
ccacccagtg 
tctgtgagtg 
gagtgtttca 
aatgcaacag 
tgtctgattc 
atagcctggg 
gttatcttag 
gcactggccg 
gcaagtcacc 
cttttcagtc 
aagggtctaa 
ttgatcaaac 
gatgtggaga 
aaagtagata 
cactttgatt 
catccacagg 



gtaattagtg 
gcggcgctgc 
tggtagagat 
ggaaccaaac 
caatggcgag 
cccttcatga 
gggaatctgg 
ttttctaccg 
aaaatatagg 
cagaccacga 
acgagcatca 
gcccagacca 
ctcaccgacc 
aagtgacctc 
agactccaag 
tcacatcaaa 
agccccgaaa 
atgcatcaaa 
agttcaacta 
atacaagtga 
ttggtggttt 
tgcctctcat 
ttgggacttt 
accatagtca 
atctgtcttc 
cagctctagg 
aatttaaaga 
ttaagaagca 
cagatgatcg 
ctcagcagcc 
aagtctacaa 



attgtcttcc 
ccccggaccg 
ttctcgaaga 
ctgcgcgcgt 
gaagttatct 
actaaaagca 
cattaatgtt 
ctatggagaa 
catagataag 
gcatcactca 
ctctgaccat 
tgactcagat 
agaacatgcc 
aactgtgtac 
acctggaaaa 
gagccgggtg 
aggctttatg 
gctactgaca 
tctctgtcca 
aaagaaggct 
tatagccatt 
gaatcgggtg 
gagtggtgat 
tagccatgaa 
tcaaaacata 
aggcctgtat 
taagaagaaa 
gttgtccaag 
aactgaaggc 
tgcagtcttg 
tgaatatgta 



agcttcgcga 
.aggggcagcc 
caccagtggg 
ggccgggccg 
gtaatcttga 
gctgctttcc 
gacttggcaa 
aataattctt 
attaaaagaa 
gaccatgagc 
gatcatcact 
agttcaggta 
agtggtagaa 
aacactgtct 
ctcttcccca 
agccggctgg 
tattccagaa 
tctcatggca 
gccatcatca 
gaaatccctc 
tccatcatca 
tttttcaaat 
gcttttttac 
gaaccagcaa 
gaagaaagtg 
ttcatgtttc 
aagaatcaga 
tatgaatctc 
tatttacgag 
gaagaagaag 
cccagagggt 



aggctagggg 
aatccaatga 
cccgttccga 
tgggacaacg 
tcctgacctt 
cccagaccac 
tttccacacg 
tgtcagttga 
tccatataca 
gtcactcaga 
cttctggtaa 
aagatcctag 
ggaatgtcaa 
ctgaaggaac 
aagatgtaag 
ctggtaggaa 
acacaaatga 
tgggcatcca 
accaaattga 
caaagaccta 
gtttcctgtc 
ttctcctgag 
accttcttcc 
tggaaatgaa 
cctattttga 
ttgttgaaca 
agaaacctga 
aactttcaac 
cagactcaca 
aggtcatgat 
gcaagaataa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 
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atgccattca catttccacg atacactcgg ccagtcagac gatctcattc accaccatca 1920 

tgactaccat catattctcc atcatcacca ccaccaaaac caccatcctc acagtcacag 1980 

ccagcgctac tctcgggagg agctgaaaga tgccggcgtc gccactttgg cctggatggt 204 0 

gataatgggt gatggcctgc acaatttcag cgatggccta gcaattggtg ctgcttttac 2100 

tgaaggctta. tcaagtggtt taagtacttc tgttgctgtg ttctgtcatg agttgcctca 2160 

tgaattaggt gactttgctg ttctactaaa ggctggcatg accgttaagc aggctgtcct 2220 

ttataatgca ttgtcagcca tgctggcgta tcttggaatg gcaacaggaa ttttcattgg 2280 

tcattatgct gaaaatgttt ctatgtggat atttgcactt actgctggct tattcatgta 2340 

tgttgctctg gttgatatgg tacctgaaat gctgcacaat gatgctagtg accatggatg 24 00 

tagccgctgg gggtatttct ttttacagaa tgctgggatg cttttgggtt ttggaattat 24 60 

gttacttatt ccatatttga acataaaatc gtgttcgtat aaatttctag ttaaggttta 2520 

aatgctagag tagcttaaaa agttgtcata gtttcagtag gtcataggga gatgagtttg 2580 

tatgctgtac tatgcagcgt ttaaagttag tgggttttgt gatttttgta ttgaatattg 2640 

ctgtctgtta caaagtcagt taaaggtacg ttttaatatt taagttattc tatcttggag 2700 

ataaaatctg tatgtgcaat tcaccggtat taccagttta ttatgtaaac aagagatttg 2760 

gcatgacatg ttctgtatgt ttcagggaaa aatgtcttta atgctttttc aagaactaac 2820 

acagttattc ctatactgga ttttaggtct ctgaagaact gctggtgttt aggaataaga 2880 

atgtgcatga agcctaaaat accaagaaag cttatactga atttaagcaa agaaataaag 2940 

gagaaaagag aagaatctga gaattgggga ggcatagatt cttataaaaa tcacaaaatt 3000 

tgttgtaaat tagaggggag aaatttagaa ttaagtataa aaaggcagaa ttagtataga 3060 

gtacattcat taaacatttt tgtcaggatt atttcccgta aaaacgtagt gagcactctc 3120 

atatactaat tagtgtacat ttaactttgt ataatacaga aatctaaata tatttaatga 3180 

attcaagcaa tatacacttg accaagaaat tggaatttca aaatgttcgt gcgggttata 3240 

taccagatga gtacagtgag tagtttatgt atcaccagac tgggttattg ccaagttata 3300 

tatcaccaaa agctgtatga ctggatgttc tggttacctg gtttacaaaa ttatcagagt '3360 

agtaaaactt tgatatatat gaggatatta aaactacact aagtatcatt tgattcgatt 3420 

cagaaagtac tttgatatct ctcagtgctt cagtgctatc attgtgagca attgtcttta 3480 

tatacggtac tgtagccata ctaggcctgt ctgtggcatt ctctagatgt ttctttttta 354 0 

cacaataaat tccttatatc agcttgaaaa aaaaaaaaaa aaaaaa 3586 

<21C> 52 
<211> 752 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_feature 
<222> 9026, 9030 
<223> n = A,T,C or G 

<400> 53 

atgcccaagc gcgcgcactg gggggccctc tccgtggtgc tgatcctgct ttggggccat 60 

ccgcgagtgg cgctggcctg cccgcatcct tgtgcctgct acgtccccag cgaggtccac 120 

tgcacgttcc gatccctggc ttccgtgccc gctggcattg ctagacacgt ggaaagaatc 180 

aatttggggt ttaatagcat acaggccctg tcagaaacct catttgcagg actgaccaag 240 

ttggagctac ttatgattca cggcaatgag atcccaagca tccccgatgg agctttaaga 300 

gacctcagct ctcttcaggt tttcaagttc agctacaaca agctgagagt gatcacagga 360 

cagaccctcc agggtctctc taacttaatg aggctgcaca ttgaccacaa caagatcgag 420 

tttatccacc ctcaagcttt caacggctta acgtctctga ggctactcca tttggaagga 480 

aatctcctcc accagctgca ccccagcacc ttctccacgt tcacattttt ggattatttc 540 

agactctcca ccataaggca cctctactta gcagagaaca tggttagaac tcttcctgcc 600 

agcatgcttc ggaacatgcc gcttctggag aatctttact tgcagggaaa tccgtggacc 660 

tgcgattgtg agatgagatg gtttttggaa tgggatgcaa aatccagagg aattctgaag 720 

tgtaaaaagg acaaagctta tgaaggcggt cagttgtgtg caatgtgctt cagtccaaag 780 

aagttgtaca aacatgagat acacaagctg aaggacatga cttgtctgaa gccttcaata 840 

gagtcccctc tgagacagaa caggagcagg agtattgagg aggagcaaga acaggaagag 900 

gatggtggca gccagctcat cctggagaaa ttccaactgc cccagtggag catctctttg 960 

aatatgaccg acgagcacgg gaacatggtg aacttggtct gtgacatcaa gaaaccaatg 1020 

gatgtgtaca agattcactt gaaccaaacg gatcctccag atattgacat aaatgcaaca 1080 

gttgccttgg actttgagtg tccaatgacc cgagaaaact atgaaaagct atggaaattg 1140 

atagcatact acagtgaagt tcccgtgaag ctacacagag agctcatgct cagcaaagac 1200 

cccagagtca gctaccagta caggcaggat gctgatgagg aagctcttta ctacacaggt 1260 

gtgagagccc agattcttgc agaaccagaa tgggtcatgc agccatccat agatatccag 1320 

ctgaaccgac gtcagagtac ggccaagaag gtgctacttt cctactacac ccagtattct 1380 

caaacaatat ccaccaaaga tacaaggcag gctcggggca gaagctgggt aatgattgag 1440 

cctagtggag ctgtgcaaag agatcagact gtcctggaag ggggtccatg ccagttgagc 1500 

tgcaacgtga aagcttctga gagtccatct atcttctggg tgcttccaga tggctccatc 1560 

ctgaaagcgc ccatggatga cccagacagc aagttctcca ttctcagcag tggctggctg 1620 

aggatcaagt ccatggagcc atctgactca ggcttgtacc agtgcattgc tcaagtgagg 1680 

gatgaaatgg accgcatggt atatagggta cttgtgcagt ctccctccac tcagccagcc 1740 

gagaaagaca cagtgacaat tggcaagaac ccaggggagt cggtgacatt gccttgcaat 1800 

gctttagcaa tacccgaagc ccaccttagc tggattcttc caaacagaag gataattaat 1860 

gatttggcta acacatcaca tgtatacatg ttgccaaatg gaactctttc catcccaaag 1920 

gtccaagtca gtgatagtgg ttactacaga tgtgtggctg tcaaccagca aggggcagac 1980 

cattttacgg tgggaatcac agtgaccaag aaagggtctg gcttgccatc caaaagaggc 2040 
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agacgcccag 
ggctcgggca 
gaggtgttcc 
agaagaaagc 
ggtcgcagag 
gagcgctggg 
gtacccccgt 
tttcctgctg 
tcctcagcag 
gccagcatgg 
agcacacctc 
actgaaggag 
tctcctactc 
acagagggtt 
gagcctccat 
gatttggaga 
cttactccaa 
tctactatag 
atccaccttg 
aaagagatgt 
agaagttctg 
ggtataatga 
ctagacaaag 
accatgagca 
aaattccgcc 
tctactcaac 
gttcctacag 
aatgcagaac 
catcgatata 
ccagaaaata 
actgtttctc 
aaaatatatt 
tcagatggaa 
attttagtca 
actatgggag 
aatccctcaa 
ggggaaaatc 
gagtttttgt 
acaactctct 
gatcaagatc 
cacaccccta 
atgtctttgg 
gcatctagag 
gcaaccccag 
ccctcttccg 
tttggtagta 
gcttctcatc 
ctacctgaaa 
cactggacca 
cagtttacaa 
cccagcattc 
gtgtttggaa 
agaattcctc 
ccacagttgg 
gagagaaaag 
gactttggcc 
actaacttac 
tcttctgtcc 
cctcctgcat 



gtgcaaaggc 
tgggagatga 
tcaaaacaaa 
tgaaactctg 
tgtttgaatc 
ctgatatttt 
tgattaaaac 
tttctccccc 
atgtacctct 
ggctagaaca 
tggaggaagt 
acctgaaggg 
tgcacacatt 
ggtctgcagc 
tggatgctgt 
ctaagtcaca 
cccccaccat 
gggaaccagg 
tgaaaagtag 
ctcagacact 
dgagtgaggg 
gcagtatgtc 
acaccacaac 
ctcacccttc 
accggcacaa 
caactcaagc 
cttgggtgga 
ccacatccaa 
ccccttctac 
aacatagaaa 
tgaaaactga 
catcttaccc 
aagaaattaa 
ctggtgaatc 
aatttaagga 
ggacggccca 
ttacagaccc 
cctctttgac 
caagcataaa 
atcttgaaac 
ctgctgcccg 
gacaaaccac 
attccaagga 
tcaacaatga 
accgggatgc 
ggagtctacc 
aactaaccag 
tgtccacaca 
acaaaccgga 
ctccaagatt 
ctagtaagtt 
ataacaacat 
attattccaa 
gagtcacccg 
ttattccagg 
ctccggcacc 
agaatatccc 
agtcctcagg 
ccaaattctg 



tctttccaga 
agagaacact 
ggatgatgcc 
gaagcattcg 
tagacgaagg 
agccaaagtc 
cacaagtcct 
ctcagcatct 
acttggtgaa 
caaccacaat 
tgttgatgac 
gacagcagcc 
agacacagtc 
agatgttgga 
ctccttggct 
accagatgag 
ctgggttaat 
tgtcccaggc 
tctaagcact 
acagggagga 
ccaagagagc 
tccagttaag 
agtaacaaca 
tcgaaggaga 
gcaaacccca 
acctgacatt 
taacacagtt 
gggaacacca 
agtgagctca 
cattgttact 
gggcccttat 
taaagtccaa 
ggatgatgtt 
aattactaat 
agaatcctct 
gcctgggagg 
tccccttctt 
agtctccaca 
agtggaggtg 
cactgtggct 
gatgaaggag 
caccactaag 
aaatgttttc 
aggaacacag 
atttaacttg 
acgtggccca 
agtccctgcc 
aagcgcttcc 
aataactaca 
atcaagtaca 
tactgaccga 
ccctgaggca 
tggaagactc 
gagaccccag 
ttcctacaac 
tccgttgttg 
tatggtctct 
aagcttccac 
gtctcttggg 



gtcagagaag 
tcaaggagac 
atcaatggag 
gaaaaagaac 
ataaacatgg 
cgtgggaaaa 
ccatccttga 
cctgtgcaga 
gaagagcacg 
ggagttattc 
ctttctgaga 
cctacactta 
tatgaaaagc 
tcgtcaccag 
gagtctgagc 
gataagatga 
gactccagta 
caatcacatc 
caagacacct 
aatatgctag 
aaatccatca 
aagcctgcgg 
acaccaaggc 
cccaacggga 
cccacaactt 
aagatttcaa 
aataccccca 
cggagaaaac 
agagcgtccg 
cccagttcag 
gattccttag 
gagacacttc 
gccacaaatg 
gccataccaa 
cctgtaggct 
ctacagacag 
aaagagcttg 
ccatttcacc 
gcttcaagtc 
attctccttt 
ccagcatcct 
ccagcacttc 
ttgaattatg 
catatgtcag 
tctacaaagc 
gatagccaac 
aaacccatcc 
agatactttg 
tatccttctg 
acaattcctc 
agaactgacc 
agaaacccag 
cctttcttta 
atacccactt 
aggatacatt 
cacactccgc 
tccacccaga 
cagagcagct 
gaaaagcccc 



acatcgtgga 
ttctgcatcc 
acaagaaagc 
cagagaccaa 
caaacaaaca 
atctccctaa 
gcctagaagt 
cagtaaccag 
ttttgggtac 
ttgttgaacc 
agactgagga 
tatctgagcc 
ccacccatga 
agcccacatc 
ccatgcaata 
aagaagacac 
catcacagtt 
tacaaggact 
tactgattaa 
agggagaccc 
ctttgcctga 
aaaccacagt 
aaaaagttgc 
gaaggagatt 
ttgccccatc 
gtcaagtgga 
aacagttgga 
acgggaagag 
gatccaagcc 
aaactatact 
attacatgac 
cagtcacata 
ttgacaaaca 
cttctcgctc 
ttccaggaac 
acatacctgt 
aggatgtgga 
aggaagaagc 
aggcagaaac 
ctgaaactag 
cgtccccatc 
ccagtccaag 
tggggaatcc 
ggccaaatga 
tggaattgga 
gccaggatgg 
taccaacagc 
taacttccca 
gggctttgcc 
tcccattgca 
aattcaatgg 
ttggaaagcc 
ccaacaagac 
ctcctgcccc 
cccatagcac 
agaccacggg 
gttctatctc 
caaagttctt 
aaatcctcac 



ggatgaaggg 
aaaggaccaa 
caagaaaggg 
tgttgcagaa 
gattaatccg 
gggcacagaa 
cacaccacct 
tgctgaagaa 
catttcctca 
tgaagtaaca 
gataacttcc 
ttatgaacca 
agagacggca 
cagtgagtat 
ctttgaccca 
ctttgcacac 
atttgaggat 
gacagacaac 
aaagggtatg 
cacacactcc 
ctccacactg 
tggtaccctc 
tccgtcatcc 
acgccccaac 
agagactttt 
gagttctctg 
aatggagaag 
gccaaacaaa 
cagcccttct 
tttgcctaga 
aaccaccaga 
taaacccaca 
taaaagtgac 
cttggtctcc 
tccaacctgg 
taccacttct 
tttcacttcc 
tggttcttcc 
caccaccctt 
accacagaat 
cacaattctc 
aatatctcaa 
agaaacagaa 
attatcaaca 
aaagcaagta 
aagagttcat 
aacagtgagg 
gtcacctcgt 
agagaacaaa 
catgtccaaa 
ttactccaaa 
tcccagtcca 
tctttctttt 
agtaatgaga 
cttccatctg 
atcaccctca 
ctttataaca 
tgcaggagga 
caagtcccca 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4 920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
54 60 
5520 
5580 
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cagactgtgt ccgtcaccgc tgagacagac 
ccaaagcctt tcgttacttg gacaaaggtt 
aggatacaac ggtttgaggt tctcaagaac 
caagatcgag gccagtatat gtgcaccgcc 
gtcttgcttt cggtcaccgt gcagcaacct 
actgtctacc tgggagacac cattgcaatg 
caaatttcct ggatcttccc tgacaggagg 
cgcatcaccc tgcacgaaaa ccggaccctt 
ggcgtctata agtgcgtggc cagcaatgca 
cacgtggcgg cactgccccc cgttatccac 
ccggggctca gcattcacat tcactgcact 
tgggtgctcg gggacggtac ccagatccgc 
gttttcccca acgggacgct ctacatccgc 
gagtgcgtgg ccgccaacct ggtaggctcc 
cgtgcagcag ccaacgcgcg catcacgggc 
ggaggaaccc tcaagctgga ctgcagcgcc 
aggctgccgt ccaagaggat gatcgacgcg 
tttgccaatg ggaccctggt ggtgaaatca 
tgcgtagctc gaaataaggt tggtgatgac 
aaaccggcca agattgaaca caaggaggag 
ctgaaagtgg actgtgtggc caccgggctt 
gacgggagtc tggtgaactc cttcatgcag 
tatgtcgtct tcaacaatgg gacactctac 
gactacacct gctttgctga aaatcaggtc 
gtggtgacag cgcccgccac catccggaac 
.ggagacgtgg tcactgtagc ctgtgaggcc 
ttgtccccaa ccaacaaggt gatccccacc 
ggcactctcc ttattcagaa agcccagcgt 
aggaacagcg cgggagagga taggaagacg 
aagatcaacg gtaaccccaa ccccatcacc 
cggaaactga ttgactgcaa agctgaaggc 
cccgagggtg tggttctgcc agctccatac 
ggttccctgg acatcaggag tttgaggaag 
cgcaacgagg gaggggaggc gaggttgatc 
aaacccatct tccacgaccc gatcagcgag 
agcctcaact gctctgccgc ggggaccccg 
ggcaccgatc tgcagagtgg acagcagctg 
ctacacatta gcggtctctc ctcggtggac 
gccgctggcc acacggagag gctggtctcc 
aagcagtatc ataacctggt cagcatcatc 
cctcccgggg ctgggcaggg acgtttctcc 
ggcccccaaa ccctgggacg cgtttctctt 
gcctcggtgt ttgacagggg tacctatgta 
gtcaccagca tccccgtgat tgtgatcgcc 
ccggtcatct acacccggcc cgggaacacc 
cccaaagctg acatcacgtg ggagttaccg 
gctcgtctgt atggaaacag atttcttcac 
acacagagag atgccggctt ctacaagtgc 
aaaacaactt acatccacgt cttctgaaat 
acaacaaagc ggggtttgta agggaagcca 
gtcacagtgc atggtggcct ctggtgggtt 
gttgggaaaa ggaagcaatg cagacacgag 
cttttgtgtt tacatcatgc caggggcttc 
tttctttttt tgcaaatgcc actcgactgc 
acattcatca aaaataagcc atagacatga 
tcacctagtt aacctgctgc agtttttaca 
tctttcagtt atttcctctg tcacttcaaa 
cagagtgact gatatatata tatatntttn 
tttatatgaa aaaagaaaaa catttcttcc 



actgtgttcc cctgtgaggc aacaggaaaa 5640 
tccacaggag ctcttatgac tccgaatacc 5700 
ggtaccttag tgatacggaa ggttcaagta 5760 
agcaacctgc acggcctgga caggatggtg 5820 
caaatcctag cctcccacta ccaggacgtc 5880 
gagtgtctgg ccaaagggac cccagccccc 5940 
gtgtggcaaa ctgtgtcccc cgtggagagc 6000 
tccatcaagg aggcgtcctt ctcagacaga 6060 
gccggggcgg acagcctggc catccgcctg 6120 
caggagaagc tggagaacat ctcgctgccc 6180 
gccaaggctg cgcccctgcc cagcgtgcgc 6240 
ccctcgcagt tcctccacgg gaacttgttt 6300 
aacctcgcgc ccaaggacag cgggcgctat 6360 
gcgcgcagga cggtgcagct gaacgtgcag 6420 
acctccccgc ggaggacgga cgtcaggtac 6480 
tcgggggacc cctggccgcg catcctctgg 6540 
ctcttcagtt ttgatagcag aatcaaggtg 6600 
gtgacggaca aagatgccgg agattacctg 6660 
tacgtggtgc tcaaagtgga tgtggtgatg 6720 
aacgaccaca aagtcttcta cgggggtgac 6780 
cccaatcccg agatctcctg gagcctccca 6840 
tcggatgaca gcggtggacg caccaagcgc 6900 
tttaacgaag tggggatgag ggaggaagga 6960 
gggaaggacg agatgagagt cagagtcaag 7020 
aagacttact tggcggttca ggtgccctat 7080 
aaaggagaac ccatgcccaa ggtgacttgg 7140 
tcctctgaga agtatcagat ataccaagat 7200 
tctgacagcg gcaactacac ctgcttggtc 7260 
gtgtggattc acgtcaacgt ccagccaccc 7320 
accgtgcggg agatagcagc cgggggcagt 7380 
atccccaccc cgagggtgtt atgggctttt 7440 
tatggaaacc ggatcactgt ccatggcaac 7500 
agcgactccg tccagctggt atgcatggca 7560 
gtgcagctca ctgtcctgga gcccatggag 7620 
aagatcacgg ccatggcggg ccacaccatc 7680 
acacccagcc tggtgtgggt ccttcccaat 7740 
cagcgcttct accacaaggc tgacggcatg 7800 
gcyggggcct accgctgcgt ggcccgcaat 7860 
ctgaaggtgg gactgaagcc agaagcaaac 7920 
aatggtgaga ccctgaagct cccctgcacc 7980 
tggacgctcc ccaatggcat gcatctggag 8040 
ctggacaatg gcaccctcac ggttcgtgag 8100 
tgcaggatgg agacggagta cggcccttcg 8160 
tatcctcccc ggatcaccag cgagcccacc 8220 
gtgaaactga actgcatggc tatggggatt 8280 
gataagtcgc atctgaaggc aggggttcag 8340 
ccccagggat cactgaccat ccagcatgcc 8400 
atggcaaaaa acattctcgg cagtgactcc 84 60 
gtggattcca gaatgattgc ttaggaactg 8520 
ggttggggaa taggagctct taaataatgt 8580 
tcaagttgag gttgatcttg atctacaatt 8640 
aaggagggct cagccttgct gagacacttt 8700 
attcagggtg tctgtgctct gactgcaatt 8760 
cttcataagc gtccatagga tatctgagga 8820 
acaacacctc actaccccat tgaagacgca 8880 
tgatagactt tgttccagat tgacaagtca 8940 
actccagctt gcccaataag gatttagaac 9000 
aattcagagt tacatacata cagctaccat 9060 
tggaactcac tttttatata atgttttata 9120 
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tatatatttt tkcctttcaa atcagacgat gagactagaa ggagaaatac tttctgtctt 9180 

attaaaatta ataaattatt ggtctttaca agacttggat acattacagc agacatggaa 9240 

aatataattt taaaaaattt ctctccaacc tccttcaaat tcagtcacca ctgttatatt 9300 

accttctcca ggaaccctcc agtggggaag gctgcgatat tagatttcct tgtatgcaaa 9360 

gtttttgttg aaagctgtgc tcagaggagg tgagaggaga ggaaggagaa aactgcatca 9420 

taactttaca gaattgaatc tagagtcttc cccgaaaagc ccagaaactt ctctgcagta 94 80 

tctggcttgt ccatctggtc taaggtggct gcttcttccc cagccatgag tcagtttgtg 9540 

cccatgaata atacacgacc tgttatttcc atgactgctt tactgtattt ttaaggtcaa 9600 

tatactgtac atttgataat aaaataatat tctcccaaaa aaaaaa 9646 

<210> 54 
<211> 2828 
<212> PRT 

<213> Homo sapiens 
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Gly Ser Gly Met 

Pro Lys Asp Gin 
720 

Gly Asp Lys Lys 
735 

His Ser Glu Lys 

750 

Phe Glu Ser Arg 
765 

Glu Arg Trp Ala 

Lys Gly Thr Glu 
800 

Leu Ser Leu Glu 
815 
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Val 


Tnr 


Pro 


Pro 

o o n 


Phe 


Pro 


Ala 


Val 


Ser 
825 


Pro 


Pro 


Ser 


Ala 


Ser 
830 


Pro 


Val 




1 nr 


vaj. 

OOO 


Tnr 


Ser 


Ala 


Glu 


blU 
Q A r\ 


ber 


Ser 


Ala 


Asp 


Val 
845 


Pro 


Leu 


Leu 


bxy 




u J.U 


Pill 

biU 


HIS 


vai 


Leu 


P 1 *r 

biy 


i nr 


He 


Ser 


Ser 


Ala 


Ser 


Met 


Gly 




P 0 










Q ^ 










r> /- rs 

860 








Leu 


(jIU 


nlS 


Asn 


His 


Asn 


Gly 


Val 


He 


Leu 


Val 


Glu 


Pro 


Glu 


Val 


Thr 


o c c 
ODD 










o "7 r» 
870 










875 










880 


Ser 


1 fir 


Pro 


Leu 


Glu 


Glu 


Val 


vai 


TV r-, 

Asp 


Asp Leu 


Ser 


Glu 


Lys 


Thr 


Glu 










885 










890 








895 




blU 


i±e 


inr 


ber 


Thr 


blU 


biy 


ASp 


T <-M 1 

LGU 


Lys Gly 


Thr 


Ala 


Ala 


Pro 


Thr 








900 










yOb 










910 






Leu 


He 


Ser 


Glu 


Pro 


Tyr 


Glu 


Pro 


Ser 


Pro 


Thr 


Leu 


His 


Thr 


Leu 


Asp 






yio 










y^u 










925 






Thr 


Val 


Tyr 


Glu 


Lys 


Pro 


Thr 


His 


Glu 


Glu 


Thr 


Ala 


Thr 


Glu 


Gly 


Trp 




Q "3 n 










y Jo 










94 0 






Ser 


Aia 


Ala 


Asp 


Val 


Gly 


Ser 


Ser 


Pro 


Glu 


Pro 


Thr 


Ser 


Ser 


Glu 


Tyr 












OCA 

950 










955 










960 


GiU 


Pro 


Pro 


Leu 


Asp 
965 


Ala 


Val 


Ser 


Leu 


Ala 
970 


Glu 


Ser 


Glu 


Pro 


Met 
975 


Gin 


Tyr 


Phe 


Asp 


Pro 

no r\ 

980 


Asp 


Leu 


Glu 


Thr 


Lys 
985 


Ser 


Gin 


Pro 


Asp 


Glu' 
990 


Asp 


Lys 


Met 


Lys 


Glu 


Asp 


Thr 


Phe 


Ala 


His 


Leu 


Thr 


Pro 


Thr 


Pro 


Thr 


He 


Trp 






995 










1000 








1005 




Val 


Asn 


Asp 


Ser 


Ser 


Thr 


Ser 


Gin 


Leu 


Phe 


Glu 


Asp 


Ser 


Thr 


He 


Gly 




1010 








1015 








1020 








Glu 


Pro 


Gly 


Val 


Pro 


Gly 


Gin 


Ser 


His 


Leu 


Gin 


Gly 


Leu 


Thr 


Asp 


Asn 


1025 








1030 








1035 








1040 


lie 


His 


Leu 


Val 


Lys 


Ser 


Ser 


Leu 


Ser 


Thr 


Gin 


Asp 


Thr 


Leu 


Leu 


He 










1045 








1050 








1055 


Lys 


Lys 


Gly 


Met 


Lys 


Glu 


Met 


Ser 


Gin 


Thr 


Leu 


Gin 


Gly Gly 


Asn 


Met 








1060 








1065 








1070 




Leu 


Glu 


Gly 


Asp 


Pro 


Thr 


His 


Ser 


Arg 


Ser 


Ser 


Glu 


Ser 


Glu 


Gly Gin 






1075. 








1080 








1085 






Glu 


Ser 


Lys 


Ser 


He 


Thr 


Leu 


Pro 


Asp 


Ser Thr Leu Gly 


He 


Met 


Ser 




1090 








1095 








1100 








Ser 


Met 


Ser 


Pro 


Val 


Lys 


Lys 


Pro 


Ala 


Glu 


Thr 


Thr 


Val 


Gly 


Thr 


Leu 


1105 








1110 








1115 








1120 


Leu 


Asp 


Lys 


Asp 


Thr 


Thr 


Thr 


Val 


Thr 


Thr 


Thr 


Pro Arg 


Gin 


Lys 


Val 










1125 








1130 








1135 


Ala 


Pro 


Ser 


Ser 


Thr 


Met 


Ser 


Thr 


His 


Pro 


Ser Arg Arg Arg 


Pro 


Asn 








1140 








1145 








1150 




Gly Arg Arg Arg 


Leu Arg 


Pro 


Asn 


Lys 


Phe Arg His Arg 


His 


Lys 


Gin 






1155 








1160 








1165 






Thr 


Pro 


Pro 


Thr 


Thr 


Phe 


Ala 


Pro 


Ser 


Glu 


Thr 


Phe 


Ser 


Thr 


Gin 


Pro 




1170 








1175 








1180 








Thr 


Gin 


Ala 


Pro 


Asp 


He 


Lys 


He 


Ser 


Ser 


Gin 


Val 


Glu 


Ser 


Ser 


Leu 


1185 








1190 








1195 








1200 


Val 


Pro 


Thr 


Ala 


Trp 


Val 


Asp Asn 


Thr 


Val 


Asn 


Thr 


Pro 


Lys 


Gin 


Leu 










1205 








1210 








1215 


Glu 


Met 


Glu 


Lys 


Asn 


Ala 


Glu 


Pro 


Thr 


Ser 


Lys 


Gly Thr 


Pro Arg Arg 








1220 








1225 








1230 




Lys 


His 


Gly Lys Arg 


Pro 


Asn 


Lys 


His 


Arg Tyr 


Thr 


Pro 


Ser 


Thr 


Val 






1235 








1240 








1245 






Ser 


Ser Arg Ala 


Ser Gly 


Ser 


Lys 


Pro 


Ser 


Pro 


Ser 


Pro 


Glu 


Asn 


Lys 



1250 1255 1260 



His Arg Asn He Val Thr Pro Ser Ser Glu Thr He Leu Leu Pro Arg 
1265 1270 1275 1280 

Thr Val Ser Leu Lys Thr Glu Gly Pro Tyr Asp Ser Leu Asp Tyr Met 
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1285 1290 1295 

Thr Thr Thr Arg Lys lie Tyr Ser Ser Tyr Pro Lys Val Gin Glu Thr 

1300 1305 1310 

Leu Pro Val Thr Tyr Lys Pro Thr Ser Asp Gly Lys Glu lie Lys Asp 

1315 1320 1325 

Asp Val Ala Thr Asn Val Asp Lys His Lys Ser Asp lie Leu Val Thr 

1330 1335 1340 

Gly Glu Ser lie Thr Asn Ala lie Pro Thr Ser Arg Ser Leu Val Ser 
1345 1350 1355 1360 

Thr Met Gly Glu Phe Lys Glu Glu Ser Ser Pro Val Gly Phe Pro Gly 

1365 1370 1375 

Thr Pro Thr Trp Asn Pro Ser Arg Thr Ala Gin Pro Gly Arg Leu Gin 

1380 1385 1390 

Thr Asp lie Pro Val Thr Thr Ser Gly Glu Asn Leu Thr Asp Pro Pro 

1395 1400 1405 

Leu Leu Lys Glu Leu Glu Asp Val Asp Phe Thr Ser Glu Phe Leu Ser 

1410 1415 1420 

Ser Leu Thr Val Ser Thr Pro Phe His Gin Glu Glu Ala Gly Ser Ser 
1425 1430 1435 1440 

Thr Thr Leu Ser Ser He Lys Val Glu Val Ala Ser Ser Gin Ala Glu 

1445 1450 1455 

Thr Thr Thr Leu Asp Gin Asp His Leu Glu Thr Thr Val Ala He Leu 

1460 1465 1470 

Leu Ser Glu Thr Arg Pro Gin Asn His Thr Pro Thr Ala Ala Arg Met 

1475 1480 1485 

Lys Glu Pro Ala Ser Ser Ser Pro Ser Thr He Leu Met Ser Leu Gly 

1490 1495 1500 

Gin Thr Thr Thr Thr Lys Pro Ala Leu Pro Ser Pro Arg He Ser Gin 
1505 1510 1515 1520 

Ala Ser Arg Asp Ser Lys Glu Asn Val Phe Leu Asn Tyr Val Gly Asn 

1525 1530 1535 

Pro Glu Thr Glu Ala Thr Pro Val Asn Asn Glu Gly Thr Gin His Met 

1540 1545 1550 

Ser Gly Pro Asn Glu Leu Ser Thr Pro Ser 'Ser Asp Arg Asp Ala Phe 

1555 1560 1565 

Asn Leu Ser Thr Lys Leu Glu Leu Glu Lys Gin Val Phe Gly Ser Arg 

1570 1575 1580 

Ser Leu Pro Arg Gly Pro Asp Ser Gin Arg Gin Asp Gly Arg Val His 
1585 1590 1595 1600 

Ala Ser His Gin Leu Thr Arg Val Pro Ala Lys Pro He Leu Pro Thr 

1605 1610 1615 

Ala Thr Val Arg Leu Pro Glu Met Ser Thr Gin Ser Ala Ser Arg Tyr 

1620 1625 1630 

Phe Val Thr Ser Gin Ser Pro Arg His Trp Thr Asn Lys Pro Glu He 

1635 1640 1645 

Thr Thr Tyr Pro Ser Gly Ala Leu Pro Glu Asn Lys Gin Phe Thr Thr 

1650 1655 1660 

Pro Arg Leu Ser Ser Thr Thr He Pro Leu Pro Leu His Met Ser Lys 
1665 1670 1675 168.0 

Pro Ser He Pro Ser Lys Phe Thr Asp Arg Arg Thr Asp Gin Phe Asn 

1685 1690 1695 

Gly Tyr Ser Lys Val Phe Gly Asn Asn Asn He Pro Glu Ala Arg Asn 

1700 1705 1710 

Pro Val Gly Lys Pro Pro Ser Pro Arg He Pro His Tyr Ser Asn Gly 

1715 1720 1725 

Arg Leu Pro Phe Phe Thr Asn Lys Thr Leu Ser Phe Pro Gin Leu Gly 

1730 1735 1740 

Val Thr Arg Arg Pro Gin He Pro Thr Ser Pro Ala Pro Val Met Arg 
1745 1750 1755 1760 
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Glu Arg Lys Val lie Pro Gly Ser Tyr Asn Arg He His Ser His Ser 

1765 1770 1775 

Thr Phe His Leu Asp Phe Gly Pro Pro Ala Pro Pro Leu Leu His Thr 

1780 1785 1790 

Pro Gin Thr Thr Gly Ser Pro Ser Thr Asn Leu Gin Asn He Pro Met 

1795 1800 1805 

Val Ser Ser Thr Gin Ser Ser lie Ser Phe He Thr Ser Ser Val Gin 

1810 1815 1820 

Ser Ser Gly Ser Phe His Gin Ser Ser Ser Lys Phe Phe Ala Gly Gly 
1825 1830 1835 1840 

Pro Pro Ala Ser Lys Phe Trp Ser Leu Gly Glu Lys Pro Gin He Leu 

1845 1850 18.55 

Thr Lys Ser Pro Gin Thr Val Ser Val Thr Ala Glu Thr Asp Thr Val 

1860 1865 1870 

Phe Pro Cys Glu Ala Thr Gly Lys Pro Lys Pro Phe Val Thr Trp Thr 

1875 1880 1885 

Lys Val Ser Thr Gly Ala Leu Met Thr Pro Asn Thr Arg He Gin Arg 

1890 1895 1900 

Phe Glu Val Leu Lys Asn Gly Thr Leu Val He Arg Lys Val Gin Val 
1905 1910 1915 1920 

Gin Asp Arg Gly Gin Tyr Met Cys Thr Ala Ser Asn Leu His Gly Leu 

1925 1930 1935 

Asp Arg Met Val Val Leu Leu. Ser Val Thr Val Gin Gin Pro Gin He 

1940 1945 1950 

Leu Ala Ser His Tyr Gin Asp Val Thr Val Tyr Leu Gly Asp Thr He 

1955 1960 1965 

Ala Met Giu Cys Leu Ala Lys Gly Thr Pro Ala Pro Gin He Ser Trp 

1970 1975 1980 

He Phe Pro Asp Arg Arg Val Trp Gin Thr Val Ser Pro Val Glu Ser 
1985 1990 1995 2000 

Arg He Thr Leu His Glu Asn Arg Thr Leu Ser He Lys Glu Ala Ser 

2005 2010 2015 

Phe Ser Asp Arg Gly Val Tyr Lys Cys Val Ala Ser Asn Ala Ala Gly 

2020 2025 2030 

Ala Asp Ser Leu Ala He Arg Leu His Val Ala Ala Leu Pro Pro Val 

2035 2040 2045 

He His Gin Glu Lys Leu Glu Asn He Ser Leu Pro Pro Gly Leu Ser 

2050 2055 2060 

He His He His Cys Thr Ala Lys Ala Ala Pro Leu Pro Ser Val Arg 
2065 2070 2075 2080 

Trp Val Leu Gly Asp Gly Thr Gin He Arg Pro Ser Gin Phe Leu His 

2085 * 2090 2095 

Gly Asn Leu Phe Val Phe Pro Asn Gly Thr Leu Tyr He Arg Asn Leu 

2100 2105 2110 

Ala Pro Lys Asp Ser Gly Arg Tyr Glu Cys Val Ala Ala Asn Leu Val 

2115 2120 2125 

Gly Ser Ala Arg Arg Thr Val Gin Leu Asn Val Gin Arg Ala Ala Ala 

2130 2135 2140 

Asn Ala Arg He Thr Gly Thr Ser Pro Arg Arg Thr Asp Val Arg Tyr 
2145 2150 2155 2160 

Gly Gly Thr Leu Lys Leu Asp Cys Ser Ala Ser Gly Asp Pro Trp Pro 

2165 2170 2175 

Arg He Leu Trp Arg Leu Pro Ser Lys Arg Met He Asp Ala Leu Phe 

2180 2185 2190 

Ser Phe Asp Ser Arg He Lys Val Phe Ala Asn Gly Thr Leu Val Val 

2195 2200 2205 

Lys Ser Val Thr Asp Lys Asp Ala Gly Asp Tyr Leu Cys Val Ala Arg 

2210 2215 2220 

Asn Lys Val Gly Asp Asp Tyr Val Val Leu Lys Val Asp Val Val Met 
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2225 2230 2235 2240 

Lys Pro Ala Lys lie Glu His Lys Glu Glu Asn Asp His Lys Val Phe 

2245 2250 2255 

Tyr Gly Gly Asp Leu Lys Val Asp Cys Val Ala Thr Gly Leu Pro Asn 

2260 2265 2270 

Pro Glu lie Ser Trp Ser Leu Pro Asp Gly Ser Leu Val Asn Ser Phe 

2275 2280 2285 

Met Gin Ser Asp Asp Ser Gly Gly Arg Thr Lys Arg Tyr Val Val Phe 

2290 2295 2300 

Asn Asn Gly Thr Leu Tyr Phe Asn Glu Val Gly Met Arg Glu Glu Gly 
2305 2310 2315 2320 

Asp Tyr Thr Cys Phe Ala Glu Asn Gin Val Gly Lys Asp Glu Met Arg 

2325 2330 2335 

Val Arg Val Lys Val Val Thr Ala Pro Ala Thr He Arg Asn Lys Thr 

2340 2345 2350 

Tyr Leu Ala Val Gin Val Pro Tyr Gly Asp Val Val Thr Val Ala Cys 

2355 2360 2365 

Glu Ala Lys Gly Glu Pro Met Pro Lys Val Thr Trp Leu Ser Pro Thr 

2370 ' 2375 2380 

Asn Lys Val He Pro Thr Ser Ser Glu Lys Tyr Gin He Tyr Gin Asp 
2385 2390 2395 2400 

Gly Thr Leu Leu He Gin Lys Ala Gin Arg Ser Asp Ser Gly Asn Tyr 

2405 2410 2415 

Thr Cys Leu Val Arg Asn Ser Ala Gly Glu Asp Arg Lys Thr Val Trp 

2420 2425 2430 

He His Val Asn Val Gin Pro Pro Lys He Asn Gly Asn Pro Asn Pro 

2435 2440 2445 

He Thr Thr Val Arg Glu He Ala Ala Gly Gly Ser Arg Lys Leu He 

2450 2455 2460 

Asp Cys Lys Ala Glu Gly He Pro Thr Pro Arg Val Leu Trp Ala Phe 
2465 2470 2475 2480 

Pro Glu Gly Val Val Leu Pro Ala Pro Tyr Tyr Gly Asn Arg He Thr 

2485 2490 2495 

Val His Gly Asn Gly Ser Leu Asp He Arg Ser Leu Arg Lys Ser Asp 

2500 2505 2510 

Ser Val Gin Leu Val Cys Met Ala Arg Asn Glu Gly Gly Glu Ala Arg 

2515 2520 2525 

Leu He Val Gin Leu Thr Val Leu Glu Pro Met Glu Lys Pro He Phe 

2530 2535 . 2540 

His Asp Pro He Ser. Glu Lys He Thr Ala Met Ala Gly His Thr He 
2545 2550 2555 2560 

Ser Leu Asn Cys Ser Ala Ala Gly Thr Pro Thr Pro Ser Leu Val Trp 

2565 2570 2575 

Val Leu Pro Asn Gly Thr Asp Leu Gin Ser Gly Gin Gin Leu Gin Arg 

2580 2585 2590 

Phe Tyr His Lys Ala Asp Gly Met Leu His He Ser Gly Leu Ser Ser 

2595 2600 2605 

Val Asp Ala Gly Ala Tyr Arg Cys Val Ala Arg Asn Ala Ala Gly His 

2610 2615 2620 

Thr Glu Arg Leu Val Ser Leu Lys Val Gly Leu Lys Pro Glu Ala Asn 
2625 2630 2635 -2640 

Lys Gin Tyr His Asn Leu Val Ser He He Asn Gly Glu Thr Leu Lys 

2645 2650 2655 

Leu Pro Cys Thr Pro Pro Gly Ala Gly Gin Gly Arg Phe Ser Trp Thr 

2660 2665 2670 

Leu Pro Asn Gly Met His Leu Glu Gly Pro Gin Thr Leu Gly Arg Val 

2675 2680 2685 

Ser Leu Leu Asp Asn Gly Thr Leu Thr Val Arg Glu Ala Ser Val Phe 
2690 2695 2700 
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Asp Arg Gly Thr Tyr Val Cys Arg Met Glu Thr Glu Tyr Gly Pro Ser 
2705 2710 2715 2720 

Val Thr Ser He Pro Val He Val He Ala Tyr Pro Pro Arg He Thr 

2725 2730 2735 

Ser Glu Pro Thr Pro Val He Tyr Thr Arg Pro Gly Asn Thr Val Lys . 

2740 2745 2750 

Leu Asn Cys Met Ala Met Gly He Pro Lys Ala Asp He Thr Trp Glu 

2755 2760 2765 

Leu Pro Asp Lys Ser His Leu Lys Ala Gly Val Gin Ala Arg Leu Tyr 

2770 2775 2780 

Gly Asn Arg Phe Leu His Pro Gin Gly Ser Leu Thr He Gin His Ala 
2785 2790 2795 2800 

Thr Gin Arg Asp Ala Gly Phe Tyr Lys Cys Met Ala Lys Asn He Leu 

2805 2810 2815 

Gly Ser Asp Ser Lys Thr Thr Tyr He His Val Phe 
2820 2825 



<210> 55 . 

<211> 6763 

<212> DNA 

<213> Homo sapiens 



<400> 55 

atggtgctga 

aagcggggca 

agtgtgattt 

cacatggcta 

acggagagag 

tgctgtctgg 

atgggctgtg 

gaggctggtg 

aggaccatct 

aaaccagaca 

ggaaaaaagg 

ctgtccgttg 

ggttttctcc 

gatgaacgga 

cccacactcc 

ggagcctgga 

gttcttgggc 

ggaaaaaggg 

ttcctgagaa 

cgtgtatgtg 

ggctgcccta 

gaattggcca 

attccagaca 

tggagtacgt 

ttgaacgtct 

ccagagactg 

tttggagcaa 

cagcctgggg 

cactccaaag 

agtaaggcgg 

tcctcacctt 

caagggagaa 

ggggcgcccc 

acagaaatca 

gacacccaag 



cgccctctga 
catgcacatc 
gggaggcacc 
ctggtgacac 
ctagggctac 
ttttgagatg 
attgggtgcc 
ctggggatgg 
ttgtgaagtt 
aattgcggct 
aagtgcccaa 
cgtggaaggc 
tgggctacgg 
cacacgaaat 
taggcgctct 
taagcagctc 
tcgcccttca 
gaaggggagg 
atggaagtca 
gtctccctgc 
acaaagcgaa 
ggtgggatta 
ctgtgtatga 
cagtcttcca 
ggccagtcaa 
aggggaaagt 
agtccctcac 
aacgctatct 
ccttcattgt 
atgttcagca 
ctcccagagc 
atgccaagga 
gaaaacccca 
ctggggagga 
accagaaacg 



agagcagtcg 
ttgccagggc 
aagaggacct 
caactctgca 
cgatccgaca 
ttcccttagc 
ttcaaggatg 
tggggcagct 
ggtggtcagc 
cttcaaggct 
caagcccttg 
accacgcctg 
ggagagtggc 
taaaaagcta 
gagagctcct 
tgtgctcttc 
tagtcctcaa 
aggaatgcag 
gaatctcatc 
agtccatgaa 
agatttcaga 
taagcagatc 
atttgcagtc 
aagaacacca 
tggcaaacct 
gaaagaatac 
ctatcctgga 
tttcaaaatc 
cgctatgcca 
gaacacggag 
tccagcttcc 
ccttcttctt 
gcttcgcgcc 
ggagctgggt 
gaccctgagg 



ggtccttggg 
gtcctggtgc 
tcccctggag 
accaccatga 
gatggggtgc 
ctttctgagc 
gctgccaaat 
gcctggggta 
gcagtggggt 
gtagatggag 
cgtgtgcgtg 
tctggagcca 
cggaagatga 
gagcacttgc 
aactctgagg 
agtacctccg 
gaagtgggaa 
aacccccgct 
cctcactccc 
ctctcagggc 
cagtacaccg 
gctaacaggc 
cgtatttcac 
gaatctgccc 
acagttgtcg 
attctttcat 
gacactactt 
cgggccacaa 
acaagcaatt 
gacaatggga 
tcccaacacc 
gacttgaaga 
aagaaggcag 
tcccgggagg 
ccgccaagta 



aactgtgcca 
tgcccaccac 
aaggtgtgat 
gcttctcaac 
gaattctggc 
ctcacttctt 
tggtgtccac 
aggtcacggg 
gtcagatcgg 
tgaaacctca 
tccggtcctc 
agagtccacg 
attatgttcc 
ttggacggaa 
aaggcacagc 
aggccagccc 
tacccccaga 
ccagccccag 
tggtccacag 
cggagccaac 
tgcgctatcg 
gtgtgctgat 
agggtgaaag 
ctaccacagc 
ctgcatcttg 
acgccccggc 
ctgccctggt 
acaggagagg 
ctttaaaatc 
aacccgaaaa 
cctctgtgcc 
acaaaatatt 
aggagctgga 
actcgcccat 
gacacggcca 



gcttctctgt 
atccttcgcc 
gcttgtgcca 
aagagctgct 
ttcggcttcc 
tggtcagcag 
agttataatg 
gatagagatc 
tcagtgggac 
cattctgctg 
agatgacagg 
cagatcacgg 
actgacaaga 
gcccggcgag 
catgcatttc 
cctgggtcca 
aagaggaaac 
gggatgtgtg 
cctcggaatc 
cagtctacag 
agagaagggg 
tgagaacctg 
agatggcaaa 
tcctgaaaac 
ggatgcgcta 
tctcaaacca 
ggatggtctg 
ccftgggacct 
tgttgcagcc 
acctgagcct 
tgcttctccc 
ggctaatggt 
tcttcagtcg 
gtcaccctca 
ctcggtggtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 
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gctcccggca ggactgcagt 
gataagcctg gcttttccct 
gcctctcctg cccaccacgc 
agcttgaatg acaacgactt 
caccccaagg gcgccttcgc 
tccagcgttc tccgcgacag 
cggaggaccc cccattcagg 
agactttctc caccccatgg 
tctccacttt ccaagggcgg 
ccatcacggt ccaccatgtc 
tctgagggag cggaggcttc 
ggaaggcagg cggaggccac 
catttgctca gacacaaacc 
gggcggggac ctcggctgca 
caccccaggg ttccctctca 
gacgaggagg atgagaagcc 
tccaggcagc ccatctcccg 
agcctgcatc ggaaggaacc 
cctcagggca agtactcctc 
gcggacacgg agggtcattc 
cctgctcgtc ctcccgcagc 
cccggccggg ccccagaaca 
ggaccccaga gcagagacgc 
gccgggcggc cccgccccac 
gcgagctcca gagggatgct 
agctacgacg acgacagcac 
gcgcgcgcca aggaggcagc 
ggcgcagggg caggtggcga 
cccagccgac ccggcggccc 
aagtcggagc ctccttccaa 
gaggacgagg aggaggagga 
tcctctgtgc caaagtggcc 
agcctcgcca aggaagagag 
gctcctgtga agcgacctct 
ccttcccgac cgccgcctcg 
tggccgcagt acaccacgcg 
ttgcgccaga ggatgatgca 
ccctcttaca gacaaggtta 
agtaatggaa aaccgaatgg 
gtggaccttg atcgtgggtt 
ggaaatcctc ttcggattaa 
acccccgtgg tgagtcctga 
ctggccaatg cccaagataa 
gaggtcatca aaaaaaccac 
acgcccctgc ctaccactac 
accaggcgtc caacaaccac 
aaacccacca ctcccatccc 
ggcaacctga taatgagctc 
tcaggcttgg agactgacac 
gattatgaat ttgagacgtc 
acaccgaggg tgatcccaga 
ctggctggaa ggaaacgatt 
gccccgtgct ctctgactga 
atccccaatg acctgaagaa 
gtggtggccg tggaaggttg 
ggagatgtgg tcacaggtta 
aagtggtcca ctcaagcttc 
acgaggtatt attttaaagt 
tcggtctcat ttgtcaccga 



gagggcccgg atgccagcgc 
ggccacgcag ccccgcccag 
gtccacccag ggcacctctc 
ggtggactca gacgaagatg 
ccagccccgg ccagccctgt 
aagctctgtg caccccggcg 
ggccgcagag gaagattcca 
gggatcatct cggctgctgc 
gaaggatggt gaggacgccc 
ctcctccgtc tcttctcatc 
tgatggtgaa ag'ccacggtg 
ggcccagacg ctgcgggccc 
ctttgctgcc aacgggaggt 
gccctccagc tccccacagt 
ctctgattcc caccctaagc 
gcttcctgcc accgttgtca 
gggctgggag gacttaagga 
catcccagag aaccccaaat 
cctggcctcc aaggctcagg 
tcccaaagca cagccagggt 
acggtcacag cagcatccca 
gcagccccct cctcccgtcg 
gggtcggtca ccttcccagc 
gtcgcagggc cgctcccact 
ccccacggcc ctccagaacc 
agaagtcgag gcccaggatg 
tgcgtccctt cccaagcacc 
ccacaggtcc cagcgcggac 
ccagtcccgc gcccgggtcc 
gcggcccctg tcctccaagt 
cgcggggttt tttaaaggcg 
ctcttcctcc actcccaggg 
ggagcctgcc atcgcgcttg 
ccccccacct ccaggcagct 
cagcgctgcc accgtgagcc 
cgccccacct ggcgacttct 
tgccagattc cgtaaccctc 
taatggcaga ccaaatgtag 
acagagaatt atcaatggcc 
agtattgaat gcagaaggaa 
actaggagga gatggtcgaa 
cggcctccca ctctttgggc 
gccaattttg agtcttggag 
ccatccccct accactacca 
aaccccgagg cccaccactg 
agtccgaacc actacgcgga 
cacctgtccc cctgggacct 
caatgggatc ccagagtgct 
tgcagtacct acggaagagg 
aaggccacca accaccactg 
ggaaggcgcc atcagttcct 
tgttgctcct tacgtgacgt 
tgcactggat cacttccaag 
gagtgatctg cctccccagc 
ccactcattt gtcattgtgg 
cttggtttac agtgcatcct 
atcagtaact cacttgccca 
gcaagcacaa aatcctcatg 
atcagataat cctctgcttg 



tgccccgaag ggaaggcgta 2160 
gggcgccccc ctcggcttcg 2220 
atcgtccttc cctgcctgcc 2280 
agcgcgctgt gggctccctc 2340 
cccccagccg ccagtccccg 2400 
caaagccagc ctcgccggcg 24 60 
gtgcctcagc cccaccctca 2520 
ccacccagcc acacctgagc 2580 
cagccaccaa ctccaatgcg 2640 
tctcgtccag gacgcaggtc 2700 
acggcgatag ggaagacggc 2760 
ggcctgcctc tggacacttc . 2820 
ctccaagcag gttcagcatt 2880 
cgactgtgcc ctcccgagcc 294 0 
ttagctcagg tatccatgga 3000 
atgaccacgt gccttcctcc 3060 
gaagcccgca gagaggggcc 3120 
ccacaggggc agatacacat 3180 
atgttcaaca gagcacagac 3240 
ccacagaccg ccacgcgtcc 3300 
gtgttcccag aaggatgaca 3360 
ccacgtccca gcaccacccg 3420 
ccaggctctc actgacccag 3480 
cctcctcgga cccttacacg 3540 
aggacgagga tgcccagggc 3600 
tgcgggcccc cgcgcacgcc 3660 
agcaggtgga gtctcccaca 3720 
atgcggcctc ccccgccagg 3780 
ccagcagggc agcgccgggg 384 0 
cccagcagtc ggtctcagcc 3900 
ggaaagaaga ccttctgtct 3960 
gcggcaaaga cgccgatggg 4020 
cccctcgcgg agggagcctg 4080 
cccccagggc ctcccacgtc 4140 
ccgtcgcggg cacccacccc 4200 
ccaccacccc gatgctgtcc 4260 
tctcccgaca gcctgccaga 4320 
aagggaaagt ccttcctggt 4 380 
ctcaaggaac aaagtgggtt 4 4 40 
ggtacctcca agattcacat 4 500 
ccattgtaga tctggaaggg 4560 
aggggcgaca tggcacacct 4 620 
gaaagccgct ggtgggcttg 4 680 
tgcagcccac cactactacg 4740 
ccaccacccg ccgcacgacc 4 800 
caaccaccac caccaccccc 4860 
tggaacggca cgacgatgat 4 920 
acgctgaaga agatgagttc 4 980 
cctacgttat atatgatgaa 5040 
agccttcgac cactgctacc 5100 
ttcctgaaga agaatttgat 5160 
acctaaataa agacccatca 5220 
tggacagcct ggatgaaatc 5280 
atgctccccg caacatcacc 5340 
actgggacaa agccacccca 5400 
atgaagactt catcaggaac 54 60 
ttgagaacct aaagcccaac 5520 
gctacggacc tatcagccct 5580 
ttgtgaggcc cccaggcggt 5640 
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1555 1560 1565 

Pro Pro Thr Thr Thr Met Gin Pro Thr Thr Thr Thr Thr Pro Leu Pro 

1570 1575 1580 

Thr Thr Thr Thr Pro Arg Pro Thr Thr Ala Thr Thr Arg Arg Thr Thr 
1585 1590 1595 1600 

Thr Arg Arg Pro Thr Thr Thr Val Arg Thr Thr Thr Arg Thr Thr Thr 

1605 1610 1615 

Thr Thr Thr Pro Lys Pro Thr Thr Pro He Pro Thr Cys Pro Pro Gly 

1620 1625 1630 

Thr Leu Glu Arg His Asp Asp Asp Gly Asn Leu He Met Ser Ser Asn 

1635 1640 1645 

Gly He Pro Glu Cys Tyr Ala Glu Glu Asp Glu Phe Ser Gly Leu Glu 

1650 1655 1660 

Thr Asp Thr Ala Val Pro Thr Glu Glu Ala Tyr Val He Tyr Asp Glu 
1665 1670 1675 1680 
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Asp Tyr Glu Phe Glu Thr Ser Arg Pro Pro Thr Thr Thr Glu Pro Ser 

1685 1690 1695 

Thr Thr Ala Thr Thr Pro Arg Val lie Pro Glu Glu Gly Ala He Ser 

1700 1705 1710 

Ser Phe Pro Glu Glu Glu Phe Asp Leu Ala Gly Arg Lys Arg Phe Val 

1715 1720 1725 

Ala Pro Tyr Val Thr Tyr Leu Asn Lys Asp Pro Ser Ala Pro Cys Ser 

1730 1735 1740 

Leu Thr Asp Ala Leu Asp His Phe Gin Val Asp Ser Leu Asp Glu lie 
1745 1750 1755 1760 

He Pro Asn Asp Leu Lys Lys Ser Asp Leu Pro Pro Gin His Ala Pro 

1765 1770 . 1775 

Arg Asn He Thr Val Val Ala Val Glu Gly Cys His Ser Phe Val He 

1780 1785 1790 

Val Asp Trp Asp Lys Ala Thr Pro Gly Asp Val Val Thr Gly Tyr Leu 

1795 1800 1805 

Val Tyr Ser Ala Ser Tyr Glu Asp Phe He Arg Asn Lys Trp Ser Thr 

1810 1815 1820 

Gin Ala Ser Ser Val Thr His Leu Pro He Glu Asn Leu Lys Pro Asn 
1825 1830 1835 1840 

Thr Arg Tyr Tyr Phe Lys Val Gin Ala Gin Asn Pro His Gly Tyr Gly 

1845 1850 1855 

Pro He Ser Pro Ser Val Ser Phe Val Thr Glu Ser Asp Asn Pro Leu 

1860 1865 1870 

Leu Val Val Arg Pro Pro Gly Gly Glu Pro He Trp He Pro Phe Ala 

1875 1880 1885 

Phe Lys His Asp Pro Ser Tyr Thr Asp Cys His Gly Arg Gin Tyr Val 

1890 1895 1900 

Lys Arg Thr Trp Tyr Arg Lys Phe Val Gly Val Val Leu Cys Asn Ser 
1905 1910 1915 1920 

Leu Arg Tyr Lys He Tyr Leu Ser Asp Asn Leu Lys Asp Thr Phe Tyr 

1925 1930 1935 

Ser He Gly Asp Ser Trp Gly Arg Gly Glu Asp His Cys Gin Phe Val 

1940 1945 1950 

Asp Ser His Leu Asp Gly Arg Thr Gly Pro Gin Ser Tyr Val Glu Ala 

1955 1960 1965 

Leu Pro Thr He Gin Gly Tyr Tyr Arg Gin Tyr Arg Gin Glu Pro Val 

1970 1975 1980 

Arg Phe Gly Asn He Gly Phe Gly Thr Pro Tyr Tyr Tyr Val Gly Trp 
1985 1990 1995 2000 

Tyr Glu Cys Gly Val Ser He Pro Gly Lys Trp 
2005 2010 



<210> 57 

<211> 1441 

<212> DNA 

<213> Homo sapiens 

<400> 57 

gtcgacccac gcgtccgccc ttacatcctc ctaggacccg gtcggtagtc gtcgccccag 60 

cccgccgggg gcgcagcgcc cgagccgcgg ccctcgagac gggaccgaga gcatcatggg 120 

cagcactgtc ccgcgctccg cctccgtgct gcttctgctg ctgctcctgc gccgggccga 180 

gcagccctgc ggggccgagc tcaccttcga gctgccggac aacgccaagc agtgcttcca 240 

cgaggaggtg gagcagggcg tgaagttctc cctggattac caggtcatca ctggaggcca 300 

ctacgatgtt gactgctatg tagaggaccc ccaggggaac accatctaca gagaaacgaa 360 

gaagcagtac gacagcttca cgtaccgggc tgaagtcaag ggcgtttatc agttttgctt 420 

cagtaatgag ttttccacct tctctcacaa gaccgtctac tttgactttc aagtgggcga 4 80 

tgagcctccc attctcccag acatggggaa cagggtcaca gctctcaccc agatggagtc 54 0 
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cgcctgcgtg 
gctgcgggag 
gtctgttggc 
aagcttcttc 
tgctctaggg 
ctgggtcgcg 
catgcccatg 
aggcatccga 
tagccactgt 
caagtgccca 
atttgtcttg 
tttctttggg 
caccctcggt 
ttggtgagct 
taagtgtgag 



accatccatg 
gcccaggacc 
gagacgattg 
acagaaaaac 
cccctcatgc 
tagcccaggg 
cttgattctt 
ctgcattaag 
gggagggtgg 
ggcctcttgg 
ggtgtcctat 
gttctctgta 
gagcaacagt 
gtgtatttcc 
cgtcaaaaaa 



aggctctgaa 
gggcccgagc 
ccctgttcgt 
gacccatcag 
cccaggctgg 
tggaggcaga 
gcacctcagc 
tgtgcagcgc 
acaggcaatg 
gcagcttagg 
cccatatgga 
cctgaggaaa 
gtcagccatg 
tagaagtaga 
aaaaaaaaaa 



aacggtgatt 
ggaagacctt 
ggtcagcttc 
cagggcagtc 
agcagctctc 
acgatgctgc 
agctgaaggt 
tgaaaagaca 
gttcagtggc 
gccctgcctc 
gaagaaaggg 
ccaggccctg 
caagcaggac 
aaactgtggg 
aaaaaaaaaa 



gactcccaga 
aatagccgag 
agtcaggtgc 
cactcctagc 
ctaggtcaca 
tgtggtagcc 
ctcagagacc 
tttacaacta 
ctggctgttg 
tgtttcatga 
gctctaagtt 
ggtgactttg 
agaatggtga 
aaactgtggc 
aaaaaaaaaa 



cgcattaccg 
tctcttactg 
tactgttgaa 
cccggcatcc 
gcctgctggg 
ctttgccttt 
agtaatcaga 
ggccagggat 
gcaggaactc 
tgcatgggtc 
ctggctcttc 
cagatctgct 
ctgggtgccc 
taataaaaac 
agggcggccg 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1441 



<210> 58 
<211> 217 
<212> PRT 

<213> Homo sapiens 

<400> 58 

Met Gly Ser Thr Val Pro Arg Ser Ala Ser Val Leu Leu Leu Leu Leu 

15 10 15 

Leu Leu Arg Arg Ala Glu Gin Pro Cys Gly Ala Glu Leu Thr Phe Glu 

20 25 30 

Leu Pro Asp Asn Ala Lys Gin Cys Phe His Glu Glu Val Glu Gin Gly 

35 40 45 

Val Lys Phe Ser Leu Asp Tyr Gin Val lie Thr Gly Gly His Tyr Asp 

50 55 60 

Val Asp Cys Tyr Val Glu Asp Pro Gin Gly Asn Thr lie Tyr Arg Glu 
65 70 75 80 

Thr Lys Lys Gin Tyr Asp Ser Phe Thr Tyr Arg Ala Glu Val Lys Gly 

85 90 95 

Val Tyr Gin Phe Cys Phe Ser Asn Glu Phe Ser Thr Phe Ser His Lys 

100 105 110 

Thr Val Tyr Phe Asp Phe Gin Val Gly Asp Glu Pro Pro lie Leu Pro 

115 120 125 

Asp Met Gly Asn Arg Val Thr Ala Leu Thr Gin Met Glu Ser Ala Cys 

130 135 140 

Val Thr lie His Glu Ala Leu Lys Thr Val He Asp Ser Gin Thr His 
145 150 155 160 

Tyr Arg Leu Arg Glu Ala Gin Asp Arg Ala Arg Ala Glu Asp Leu Asn 

165 170 175 

Ser Arg Val Ser Tyr Trp Ser Val Gly Glu Thr He Ala Leu Phe Val 

180 185 190 

Val Ser Phe Ser Gin Val Leu Leu Leu Lys Ser Phe Phe Thr Glu Lys 

195 200 205 

Arg Pro He Ser Arg Ala Val His Ser 
210 215 



<210> 59 

<211> 2316 • 

<212> DNA 

<213> Homo sapiens 



<400> 59 
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ccacgcgtcc 
agcagcacca 
cctgatggtt 
gcttggggca 
atgctcactc 
cctactcagt 
atcatcggac 
tccatctgct 
aggaagcccc 
gatgttatcg 
tctgagcttt 
ttcttgatgg 
tggaagagac 
gcctggatca 
agctccgcct 
tggctgctca 
cctcaactcg 
actcaaggtt 
ctgcagaacc 
ccttacaaag 
aaatgcagcc 
ttctgagaaa 
caattcttcc 
agtatttttt 
actcaagttt 
cccaggcttg 
agcaatcctc 
cagcctaggc 
tcacgctggc 
gctctgtggc 
ccactggcct 
ttgcctgggt 
cacattgggc 
tgtccccaaa 
ctttcagacc 
ctcacttcaa 
gatttaccgc 
tcattgtata 
tggtggtggc 



ggtctccccc 
agttcacggc 
gccgcaatgg 
tcgtcctaga 
tcccgatcct 
ttctcttcct 
tggacgggag 
tctcctgcct 
tttccctgtt 
ctattgaata 
ccgctcctcg 
cgctgacctt 
atggggccca 
ccctgctcat 
tggctgccaa 
caaagcaacg 
tgaagaagag 
ttgaagagac 
agcctcccca 
actatgaagt 
gggcggcaga 
actgtacaag 
atgctggggc 
tttttttgtc 
agacccttac 
agtgcagtgg 
ccatctccat 
ccttaatctt 
tcacacagct 
ccagagcaga 
gaatctacac 
aggagaggct 
ctcagcagct 
cttgctgtca 
tcactagcac 
attcctgggg 
atttacggct 
ataagttatt 
agcaaaaaaa 



agcactgagg 
caacgccttg 
cctgaaatcc 
aacggtggcc 
cgtctgcaag 
cctgggtgtg 
cacagggccc 
gctggctcat 
ggtgattctg 
tattgtcctg 
tcgcaatgaa 
cctcatgtcc 
catctacctc 
gcttcctgac 
tggctgggtg 
aaaccccatg 
ctatggtgtg 
aggggacacg 
aaaggaattc 
aaagaaagag 
tctagcggga 
acactacggg 
tgatgtgggc 
tcatcctttg 
tctttttgtt 
tgcgatcaca 
ctcccaaagt 
gctgttattt 
ctaggggcct 
cctgcatatc 
tggaagccaa 
aaagatcacc 
ccccagcacc 
attccgagat 
aagcccggtt 
ctgatacttc 
gcattctgta 
cacctgagta 
aaaaaaaaaa 



agctcgcctg 
gcactagggt 
aagtactaca 
acagccgggg 
gtgcaggact 
ttgggcatct 
acacgcttct 
gctgtcagtc 
ggtctggccg 
accatgaata 
gactttgtcc 
tccttcacct 
acgatgctcc 
tttgaccgca 
ttcctgttgg 
gattatcctg 
gagaacagag 
ctctatgccc 
tccatcccac 
ggcagctaac 
gctcaaaggg 
aacagtttgc 
tagtaagact 
gatacttctt 
tgttttttga 
gcccagtgca 
gctgggatga 
tccatggact 
gctcctctaa 
tgagcaaaaa 
cttgctggca 
ctaaatttac 
aattcacagg 
ctaatctccc 
gctccttgtc 
tctcatcttg 
agtgggcatg 
tgcaataaag 
aaaaaa 



ctgccctctt 
ccagaatggc 
gactttgtga 
ttgtgacctc 
ccaacaggcg 
ttggcctcac 
tcctctttgg 
tgaccaagct 
tgggcttcag 
ggaccaacgt 
tcctgctcac 
tctgtggttc 
tctccattgc 
ggtgggatga 
cttatgttag 
ttgaggatgc 
cctactctca 
cctattccac 
gggcccacgc 
tctgtcctga 
atgtgggcga 
ctccctccca 
ccagttctta 
ttaagtggga 
aacaggatct 
gcctcgacca 
caggcgtgag 
aaaggtctgg 
ctcacagtgg 
tagcaaaagc 
cccccgctcc 
tcatctctct 
tcacccctct 
cctacgctct 
aggagaattt 
caccccaacc 
gtctcctaat 
atgtggtggc 



gcgcgcggga 
tacaacagtc 
taaggctgaa 
ggtggccttc 
aaaaatgctg 
cttcgccttc 
gatcctcttt 
cgtccggggg 
cctagtccag 
caatgtcttt 
ctacgtcctc 
cttcacgggc 
catctgggtg 
caccatcctc 
tcccgagttt 
tttctgtaaa 
agaggaaatc 
acattttcag 
ttggccgagc 
agagtgggac 
aatctgagtc 
gcctcaacca 
gaggcgctgt 
gtctcaggca 
tgctctgtca 
cctgtgctca 
ccacagctcc 
tcatctgagc 
gttttgtgag 
ctctctcagc 
ccaacccttc 
agtgctgcct 
cttcttgcac 
gccaggaatt 
gtagatcatt 
tctgtaaata 
ggaggagtgt 
cactctttca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2316 



<210> 60 
<211> 357 
<212> PRT 

<213> Homo sapiens 



<400> 60 

Met Ala Thr Thr Val 

1 5 
Tyr Tyr Arg Leu Cys 
20 

Thr Val Ala Thr Ala 
35 

Leu Pro lie Leu Val 
50 

Leu Pro Thr Gin Phe 

65 

Leu Thr Phe Ala Phe 

85 

Arg Phe Phe Leu Phe 



Pro Asp Gly Cys 
Asp Lys 
Gly Val 



Cys Lys 

55 
Leu Phe 

70 

He He 



Ala Glu 

25 
Val Thr 
40 

Val Gin 



Leu Leu 
Gly Leu 
Gly He Leu Phe 



Arg 


Asn 


Gly 


Leu 


Lys 


Ser 


Lys 


10 










15 




Ala 


Trp 


Gly 


He 


Val 


Leu 


Glu 










30 






Ser 


Val 


Ala 


Phe 


Met 


Leu 


Thr 








45 








Asp 


Ser 


Asn 


Arg 


Arg 


Lys 


Met 






60 










Gly 


Val 


Leu 


Gly 


He 


Phe 


Gly 




75 










80 


Asp 


Gly 


Ser 


Thr 


Gly 


Pro 


Thr 


90 










95 




Ser 


He 


Cys 


Phe 


Ser 


Cys 


Leu 
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X Vi 1./ 










1 OR 








110 










Hie: 

nx o 


Al 

AX d 


V ax 


Car 

OCX 


Leu 


Th-r 

1 llX 


Lys 


Leu 


\/a 1 

y ax 


Arg Gly Arg 


Lys 


Pro 






IIS 










1 90 

X Z U 
















L6U 




Leu 


Leu 


V d.x 


T 1 o 

X Jie 


Leu 


oxy 


Leu 


ai a 

AX a 


1 

vax 


\r pKa 
oxy rne 


oer 


Leu 


vai 




X V/ 










J. J o 




















Asp 


Val 


X xc 


Al a 
AX a 


Tip 
X xc 


f^l n 
ox u 


1 yr 


Tip 
X Xc 


Val 
V dX 


Leu 


inr Met 


Asn 


Arg 


Th y- 
L Ox 






















1 R R 
100 








1 CO 

1 bU 


Asn 


V d X 


ria 11 


vax 


r ne 


Ser 


o±U 


Leu 


Ser 


Axa 


rro 


Arg Arg 


Asn 


islU 


Asp 










X DO 










1 / U 








1 f 0 






Val 


T .431 1 
±Js£ u 




Leu 


1 nx 


lyr 


V;* 1 
V dx 


T All 
Xi6U 


Phe 


Leu 


Met Axa 


Leu 


Thy 

1 nr 


PVl A 

trne 








1 ft n 

X o u 










1 P R 
X O D 








1 yu 






Leu 


Met 


OCX 




Pho 








ox y 


Ser 


Phe 


inr (jiy 


Trp 


Lys 


Arg 






1 QS 










9 00 
z u u 
















His 


Gly 


AT A 


nx o 


Tip 
X xe 


lyr 


Leu 


Th-r 
i ilx 




Leu 


Leu 


ber lie 


Ala 


T 1 o 

lie 


Trp 




210 




























Val 


Ala 


irp 


Tip 
J. X6 


1 liX 


Leu 


Leu 


rJe L 


Leu 


Pro Asp 


Phe Asp 


Arg 


Arg 


Trp 


225 










0 "5 n 










235 








z4 U 


Asp Asp 


X IIX 


T 1 <a 
XX6 


Leu 


Ser 


Qor- 
ocx 


Axa 


Leu 


Ala 


Ala 


Asn tiiy 


Trp 


vai 


rne 




















250 








O C Q 

zoo 




Leu 


Leu 




lyr 




C A V 
OCX 


Pro 


oXU 


rrne 


Trp 


Leu 


Leu Thr 


Lys 


Pin 

Cjin 


Arg 


















9 R 
Z DO 








O "7 n 






Asn 


Pro 


lYiei, 


Asp 


Tyr 


Pro 


V a J. 




Asp 


Ala 


Phe 


Cys Lys 


Pro 


bin 


Leu 






^ f D 










9 P 0 
Z O U 








o o c 

zoo 








Val 


Lys 


Lys 


OCL 


lyr 


\j±y 


1 

V cl X 


v?X u 


A c n 


Arg 


Ala 


iyr oer 


vjxn 


Pin 

oXU 


Pin 




290 




















300 








He 


Thr 


wXli 


wX 


Php 


sJX U 


f^l n 

ox LI 


Thr 

1 IIX 


f^l V 

v5X y 


Asp 


Thr 


Leu Tyr 


Ala 


P V r\ 


Tyr 


305 










310 










315 








320 


Ser 


Thr 


His 


Phe 


Gin 


Leu 


Gin 


Asn 


Gin 


Pro 


Pro 


Gin Lys 


Glu 


Phe 


Ser 










325 










330 








335 




He 


Pro 


Arg 


Ala 


His 


Ala 


Trp 


Pro 


Ser 


Pro Tyr 


Lys Asp 


Tyr 


Glu 


Val 








340 










345 








350 






Lys 


Lys 


Glu 


Gly 


Ser 























355 



<210> 61 

<211> 4651 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 2270, 3639, 3724, 4265, 4638, 4644, 4645, 4647, 4649, 4650, 
4651 

<223> n ^ A,T,C or G 
<400> 61 

ttttttagtt ttacctagtt ttatttgtct atttgagtat tgtccttgaa tttaaaattt 60 

ttttcagccc caactgatac acacacatat acatacataa cacatgtgtg tgtgtgtagc 120 

ttacagagtg tttataggaa actgattttg tatactttgg ctactttgtt gtaagttcta 180 

gttttttttc ttttattatt aaactagtgc acgacatcaa tgctatatga ttggtgtttc 240 

gttgacctag aaataatgca tgccatcttc ttttcacagc tgtgtgccaa ccacgatgca 300 

aacatggtga atgtatcggg ccaaacaagt gcaagtgtca tcctggttat gctggaaaaa 360 

ccttctaact cgtgtgaaga cgagcacatc ccagctcqtc ttgaccaagg cagtgaacag 420 

cctcttttcc aacccctgga tcaccaagcc acaagtttgc cttcaaggga tctaaatgag 480 

tgtggcctga agccccggcc ctgtaagcac aggtgcatga acacttacgg cagctacaag 54 0 

tgctactgtc tcaacggata tatgctcatg ccggatggtt cctgctcaag tgccctgacc 600 

tgctccatgg caaactgtca gtatggctgt gatgttgtta aaggacaaat acggtgccag 660 

tgcccatccc ctggcctgca cctggctcct gatgggagga cctgtgtaga tgttgatgaa 720 
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tgtgctacag gaagagcctc ctgccctaga tttaggcaat gtgtcaacac ttttgggagc 780 
tacatctgca agtgtcataa aggcttcgat ctcatgcata ttggaggcaa atatcaatgt 840 
catgacatag acgaatgctc acttggtcag tatcagtgca gcagctttgc tcgatgttat 900 
aacgtacgtg ggtcctacaa gtgcaaatgt aaagaaggat accagggtga tggactgact 960 
tgtgtgtata tcccaaaagt tatgattgaa ccttcaggtc caattcatgt accaaaggga 1020 
aatggtacca ttttaaaggg tgacacagga aataataatt ggattcctga tgttggaagt 1080 
acttggtggc ctccgaagac accatatatt cctcctatca ttaccaacag gcctacttct 1140 
aagccaacaa caagacctac accaaagcca acaccaattc ctactccacc accaccacca 1200 
cccctgccaa cagagctcag aacacctcta ccacctacaa ccccagaaag gccaaccacc 1260 
ggactgacaa ctatagcacc agctgccagt acacctccag gagggattac agttgacaac 1320 
agggtacaga cagaccctca gaaacccaga ggagatgtgt tcattccacg gcaaccttca 1380 
aatgacttgt ttgaaatatt tgaaatagaa agaggagtca gtgcagacga tgaagcaaag 1440 
gatgatccag gtgttctggt acacagttgt aattttgacc atggactttg tggatggatc 1500 
agggagaaag acaatgactt gcactgggaa ccaatcaggg acccagcagg tggacaatat 1560 
ctgacagtgt cggcagccaa agccccaggg ggaaaagctg cacgcttggt gctacctctc 1620 
ggccgcctca tgcattcagg ggacctgtgc ctgtcattca ggcacaaggt gacggggctg 1680 
cactctggca cactccaggt gtttgtgaga aaacacggtg cccacggagc agccctgtgg 1740 
ggaagaaatg gtggccatgg ctggaggcaa acacagatca ccttgcgagg ggctgacatc 1800 
aagagcgtcg tcttcaaagg tgaaaaaagg cgtggtcaca ctggggagat tggattagat 1860 
gatgtgagct tgaaaaaagg ccactgctct gaagaacgct aacaactcca gaactaacaa 1920 
tgaactccta tgttgctcta tcctcttttt ccaattctca tcttctctcc tcttctccct 1980 
tttatcaggc ctaggagaag agtgggtcag tgggtcagaa ggaagtctat ttggtgaccc 2040 
aggtttttct ggcctgcttt tgtgcaatcc caatgaacag tgataccctc cttgaaatac 2100 
aggggcatcg cagacacatc aaagccatct gtgggtgttg ccttccatcc tgtgtctctt 2160 
tcaggaaggc attcagcatg cgtgagccat accatcctcc atcctgatta caaggtgctc 2220 
cttgtagcaa attatgagag tgagttacgg gagcagtttt taaaagaaan tctttkcara 2280 
kggstwtraw gtwwtkkgty cggkgttgkm cccawgrgkr gkwttgrcct tcccttgrra 2340 
wawrawrwac aawagkgctk gkgaaawwra mwatmcccty ttcmytttaa rwwarwtytg 2400 
gccygraccys aamatytkwy ttttaygtgs crkctcmytt twttaaaawa arggtgtgta 24 60 
acatatcaag atacatttat ttttatctgt tttttttttt cctgttaaag acaattatgt 2520 
agagtgggca cgtaatccct ccttagtagt attgtgtttt gtgtaaatgt gctattgata 2580 
ttaagtattt acatgttcca aatatttaca gactctagtt gcaaggtaaa gggcagcttg 2640 
tgatctcaaa aaaatacatg gtgaaatgtc atccagttcc atgaccttat attggcagca 2700 
gtaggaaatt ggcagaagtg ttgggttgtg gtaacggagt gatgaatttt tttttaatgg 2760 
ccttgagttt gatctctgca aaggatagga aacctttagg aagacaagaa actgcagtta 2820 
atttagaact gtcactgttt caagttacac tttaaaacca cagcttttac catcataaca 2880 
tggctctggt aatatgtagg aagctttata aaagttttgg ttgattcaga aaaaggatcc 2940 
tgttgcagag tgagaggaag cataggggga aactccattg gaacagattt tcacacaacg 3000 
ttttaaattg atataagttt aggcagttgt agttcataac ttatgttgct catgttgtgc 3060 
tgtgtcagga tgggatagga agcaagtccc atgcttagag gcatgggatg tgttggaacg 3120 
ggatttacac acactggagg agcagggcaa gttggaattc taagatccat gaacccccaa 3180 
ctgtatttcc tccctgcata ttttaccaat atattaaaaa acaatgtaac ttttaaaagg 3240 
catcattcct gaggtttgtc ttaatttctg attaagtaat cagaatattt tctgctattt 3300 
ttgccaggaa tcacaaagat gattaaaggg ttggaaaaaa agatctatga tggaaaatta 3360 
aaggaactgg gattattgag cctggagaag agaagactga ggggcaaacc attgatggtt 3420 
ttcaagtata tgaagggttg gcacagagag ggtggcgacc agctgttctc catatgccac 3480 
taagaataga acaagaggaa actggcttag actagagtat aagggagcat ttcttggcag 3540 
gggccattgt tagaatactt cataaaaaaa gaagtgtgaa aatctcagta tctctctctc 3600 
tttctaaaaa attagataaa aatttgtcta tttaagatng gttaaagatg ttcttaccca 3660 
aggaaaagta acaaattata gaatttccca aaagatgttt tgatcctact agtagtatgc 3720 
agtngaaaat ctttagaact aaataatttg gacaaggctt aatttaggca tttccctctt 3780 
gacctcctaa tggagaggga ttgaaagggg aagagcccac caaatgctga gctcactgaa 3840 
atatctctcc cttatggcaa tcctagcagt attaaagaaa aaaggaaact atttattcca 3900 
aatgagagta tgatggacag atattttagt atctcagtaa tgtcctagtg tggcggtggt 3960 
tttcaatgtt tcttcatggt aaaggtataa gcctttcatt tgttcaatgg atgatgtttc 4020 
ggattttttt tttttaagag atccttcaag gaacacagtt cagagagatt ttcatcgggt 4080 
gcattctctc tgcttcgtgt gtgacaagtt atcttggctg ctgagaaaga gtgccctgcc 4140 
ccacaccggc agacctttcc ttcacctcat cagtatgatt cagtttctct tatcaattgg 4200 
actctcccag gttccacaga acagtaatat tttttgaaca ataggtacaa tagaaggtct 4260 
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tctgntcatt taacctggta aaggcagggc tggaggggga aaataaatca ttaagccttt 4320 
gagtaacggc agaatatatg gctgtagatc catttttaat ggttcatttc ctttatggtc 4380 
atataactgc acagctgaag atgaaagggg aaaataaatg aaaattttac ttttcgatgc 4440 
caatgataca ttgcactaaa ctgatggaag aagttatcca aagtactgta taacatcttg 4500 
tttattattt aatgttttct aaaataaaaa atgttagtgg ttttccaaat ggcctaataa 4560 
aaacaattat ttgtaaataa aaacactgtt agtaataaaa aaaaaaaaaa aaaaaaaaaa 4620 
aarrrrmmra aimrnnmaancc gccnntngnn n 4 651 



<210> 62 
<211> 560 

<212> PRT 

<213> Homo sapiens 



<400> 62 



Met 


Leu 


Tyr 


Asp 


Trp 


Cys 


Phe 


Val 


1 








5 








Phe 


Phe 


Ser 


Gin 


Leu 


Cys 


Ala 


Asn 








20 










Ser 


Gly 


Gin 


Thr 


Ser 


Ala 


Ser 


Val 






35 










40 


Ser 


Asn 


Ser 


Cys 


Glu 


Asp 


Glu 


His 




50 










55 




Ser 


Glu 


Gin 


Pro 


Leu 


Phe 


Gin 


Pro 


65 










70 






Pro 


Ser 


Arg 


Asp 


Leu 


Asn 


Glu 


Cys 










85 








His 


Arg 


Cys 


Met 


Asn 


Thr 


Tyr 


Gly 








100 










Gly 


Tyr 


Met 


Leu 


Met 


Pro 


Asp 


Gly 






115 










120 


Ser 


Met 


Aia 


Asn 


Cys 


Gin 


Tyr 


Gly 




130 










135 




Arg 


Cys 


Gin 


Cys 


Pro 


Ser 


Pro 


Gly 


145 










150 






Thr 


Cys 


Val 


Asp 


Val 


Asp 


Glu 


Cys 










165 








Arg 


Phe 


Arg 


Gin 


Cys 


Val 


Asn 


Thr 








180 










His 


Lys 


Gly 


Phe 


Asp 


Leu 


Met 


His 






195 










200 


Asp 


He 


Asp 


Glu 


Cys 


Ser 


Leu 


Gly 




210 










215 




Arg 


Cys 


Tyr 


Asn 


Val 


Arg 


Gly 


Ser 


225 










230 






Tyr 


Gin 


Gly 


Asp 


Gly 


Leu 


Thr 


Cys 










245 








Glu 


Pro 


Ser 


Gly 


Pro 


He 


His 


Val 








260 










Lys 


Gly 


Asp 


Thr 


Gly 


Asn 


Asn 


Asn 






275 










280 


Trp 


Trp 


Pro 


Pro 


Lys 


Thr 


Pro 


Tyr 




290 










295 




Pro 


Thr 


Ser 


Lys 


Pro 


Thr 


Thr 


Arg 


305 










310 






Pro 


Thr 


Pro 


Pro 


Pro 


Pro 


Pro 


Pro 










325 








Leu 


Pro 


Pro 


Thr 


Thr 


Pro 


Glu 


Arg 








340 










Ala 


Pro 


Ala 


Ala 


Ser 


Thr 


Pro 


Pro 



Asp 


Leu 


Glu 


He 


Met 


His 


Ala 


He 




10 










15 




His 


Asp 


Ala 


Asn 


Met 


Val 


Asn 


Val 


25 










30 






He 


Leu 


Val 


Met 


Leu 


Glu 


Lys 


Pro 










45 








He 


Pro 


Ala 


Pro 


Leu 


Asp 


Gin 


Gly 








60 










Leu 


Asp 


His 


Gin 


Ala 


Thr 


Ser 


Leu 






75 










80 


Gly 


Leu 


Lys 


Pro 


Arg 


Pro 


Cys 


Lys 




90 










95 




Ser 


Tyr 


Lys 


Cys 


Tyr 


Cys 


Leu 


Asn 


105 










110 






Ser 


Cys 


Ser 


Ser 


Ala 


Leu 


Thr 


Cys 










125 








Cys 


Asp 


Val 


Val 


Lys 


Gly 


Gin 


He 








140 










Leu 


His 


Leu 


Ala 


Pro 


Asp 


Gly 


Arg 






155 










160 


Ala 


Thr 


Gly 


Arg 


Ala 


Ser 


Cys 


Pro 




170 










175 




Phe 


Gly 


Ser 


Tyr 


He 


Cys 


Lys 


Cys 


185 










190 








uxy 


oJ.y 


Lys 


Tyr 


oxn 


Cys 


nJLS 










205 








Gin 


Tyr 


Gin 


Cys 


Ser 


Ser 


Phe 


Ala 








220 










Tyr 


Lys 


Cys 


Lys 


Cys 


Lys 


Glu 


Gly 






235 










240 


Val 


Tyr 


He 


Pro 


Lys 


Val 


Met 


He 




250 










255 




Pro 


Lys 


Gly 


Asn 


Gly 


Thr 


He 


Leu 


265 










270 






Trp 


He 


Pro 


Asp 


Val 


Gly 


Ser 


Thr 










285 








He 


Pro 


Pro 


He 


He 


Thr 


Asn 


Arg 








300 










Pro 


Thr 


Pro 


Lys 


Pro 


Thr 


Pro 


He 






315 










320 


Leu 


Pro 


Thr 


Glu 


Leu 


Arg 


Thr 


Pro 




330 










335 




Pro 


Thr 


Thr 


Gly 


Leu 


Thr 


Thr 


He 


345 










350 






Gly 


Gly 


He 


Thr 


Val 


Asp 


Asn 


Arg 
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"3 c; 






360 




"5 c 






vax 


r* 1 n 


Th-r 

i nx 


Asp Pro 


Gin Lys 


Pro 


Arg Gly Asp 


vai rne i±e 


Pro 


Arg 




"5 T n 






375 






oo U 






bin 


Pro 


O £X V" 


Asn Asp 


Leu Phe 


Glu 


He Phe Glu 


lie blu Arg 


Gly 


vai 


"3 Q R 








390 




395 




400 






Asp 


Asp Glu 


Ala Lys 


Asp Asp Pro Gly 


vai lieu Val 


His 


Ser 








405 






410 




/lie 

415 




Cys 


Asn 


rne 


Asp His Gly Leu 


Cys 


Gly Trp He 


Arg biu Lys 


Asp 


Asn 








420 






425 


430 






Asp 


L6U 


nXS 


Trp Glu 


Pro lie 


Arg Asp Pro Ala 


biy biy bin 


Tyr 


Leu 






/} c: 






440 




/ /I c: 
^ 4 □ 






Th-r 
i iiir 


V d J. 


O CX 


Ala Ala 


Lys Ala 


Pro 


Gly Gly Lys 


fv±a HI a Arg 


Leu 


vai 




/! e^n 
^ DU 






455 












L6U 


rro 


Leu 


Gly Arg 


Leu Met 


His 


Ser Gly Asp 


Leu Cys Leu 


Ser 


rne 


4 DO 








470 




475 






480 




nis 


Lys 


Val Thr 


Gly Leu 


His 


Ser Gly Thr 


lieu bin vai 


rne 


vai 








485 






490 




4 95 




Arg 


Lys 


niS 


Gly Ala 


His Gly 


Ala 


Ala Leu Trp 


biy Arg Asn 


biy 


biy 








500 






505 


510 






His 


Gly 


Trp 


Arg Gin Thr Gin 


lie 


Thr Leu Arg 


Gly Ala Asp 


He 


Lys 






515 






520 




525 






Ser 


Val 


Val 


Phe Lys 


Gly Glu 


Lys 


Arg Arg Gly 


His Thr Gly 


Glu 


He 




530 






535 






540 






Gly Leu 


Asp 


Asp Val 


Ser Leu 


Lys 


Lys Gly His 


Cys Ser Glu 


Glu 


Arg 


545 








550 




555 






560 



<210> 63 
<211> 4461 
<212> DNA 

<213> Homo sapiens 
<400> 63 

cgaccccgcg tccgggggca ttgcgtggtg gaaagttgcg tgcggcagag aaccgaaggt 60 

gcagcgccac agcccagggg acggtgtgtc tgggagaaga cgctgcccct gcgtcgggac 120 

ccgccagcgc gcgggcaccg cggggcccgg gacgacgccc cctcctgcgg cgtggactcc 180 

gtcagtggcc caccaagaag gaggaggaat atggaatcca agggggccag ttcctgccgt 240 

ctgctcttct gcctcttgat ctccgccacc gtcttcaggc caggccttgg atggtatact 300 

gtaaattcag catatggaga taccattatc ataccttgcc gacttgacgt acctcagaat 360 

ctcatgtttg gcaaatggaa atatgaaaag cccgatggct ccccagtatt tattgccttc 420 

agatcctcta caaagaaaag tgtgcagtac gacgatgtac cagaatacaa agacagattg 480 

aacctctcag aaaactacac tttgtctatc agtaatgcaa ggatcagtga tgaaaagaga 540 

tttgtgtgca tgctagtaac tgaggacaac gtgtttgagg cacctacaat agtcaaggtg 600 

ttcaagcaac catctaaacc tgaaattgta agcaaagcac tgtttctcga aacagagcag 660 

ctaaaaaagt tgggtgactg catttcagaa gacagttatc cagatggcaa tatcacatgg 720 

tacaggaatg gaaaagtgct acatcccctt gaaggagcgg tggtcataat ttttaaaaag 780 

gaaatggacc cagtgactca gctctatacc atgacttcca ccctggagta caagacaacc 840 

aaggctgaca tacaaatgcc attcacctgc tcggtgacat attatggacc atctggccag 900 

aaaacaattc attctgaaca ggcagtattt gatatttact atcctacaga gcaggtgaca 960 

atacaagtgc tgccaccaaa aaatgccatc aaagaagggg ataacatcac tcttaaatgc 1020 

ttagggaatg gcaaccctcc cccagaggaa tttttgtttt acttaccagg acagcccgaa 1080 

ggaataagaa gctcaaatac ttacacactg atggatgtga ggcgcaatgc aacaggagac 1140 

tacaagtgtt ccctgataga caaaaaaagc atgattgctt caacagccat cacagttcac 1200 

tatttggatt tgtccttaaa cccaagtgga gaagtgacta gacagattgg tgatgcccta 1260 

cccgtgtcat gcacaatatc tgctagcagg aatgcaactg tggtatggat gaaagataac 1320 

atcaggcttc gatctagccc gtcattttct agtcttcatt atcaggatgc tggaaactat 1380 

gtctgcgaaa ctgctctgca ggaggttgaa ggactaaaga aaagagagtc attgactctc 1440 

attgtagaag gcaaacctca aataaaaatg acaaagaaaa ctgatcccag tggactatct 1500 

aaaacaataa tctgccatgt ggaaggtttt ccaaagccag ccattcagtg gacaattact 1560 
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ggcagtggaa 
agtaaaatta 
ctggagagaa 
gcagacgaga 
ggaatcgttg 
atgaagaagt 
aacaaaaagt 
tgtccagaga 
ataaacaaag 
ctgtcacagg 
taagaacaag 
ttccgtaaat 
gtagctattt 
tgtaatgttt 
gattgctatc 
ttgtgaagat 
gaggggaggc 
cagcgaactc 
caaactgaca 
tctacagaca 
gagacagttt 
ctataattac 
tttcatgttt 
ctgtgttatt 
accctacaga 
atcaagcagt 
tttgtaactt 
gaaacttttt 
tttcaagcag 
acacaacttt 
gaaaggaaaa 
caatgaaaaa 
ctgcattatt 
agtgactccg 
ttcttagact 
ctctaaactt 
taaaccaaag 
ttatctctgt 
ttctaaaatt 
ttccatataa 
ctaagatata 
tcttggtttc 
cagaacatga 
gacaacatta 
gcagccacat 
tttcggctct 
actccctata 
ctaaaagata 
aaaaaaaaaa 



gcgtcataaa 
tcatttcccc 
cagtaaactc 
taagtgatga 
ttggtctcct 
caaagactgc 
tagaagaaaa 
taaaaatcat 
acattgacag 
ttttcagaga 
ttttggcagc 
gtctgcactg 
ttatacactg 
ttatttcaat 
agcaatgccc 
tggggacact 
cgagagtcag 
tcagaaataa 
attttaccta 
cctgctaata 
ggaaaaatca 
aaattggtgt 
aaaaagatat 
ttccacagaa 
tattgaatgc 
acatgaaagt 
tataaacatg 
gaagtagacc 
aagcctggtt 
aaagggaaga 
ggaaaactgg 
taaattgaaa 
tagaaacata 
tttaataaaa 
cgatgctgta 
ttattgaagt 
taaaagggga 
aaatactgtg 
tttaaacacc 
atattatttt 
tttgcagaaa 
cacttgcaat 
gaaactcaga 
ttaccatcga 
gcacgaagat 
ccaatttaac 
tgtgattata 
ataaaactga 
gggcggccgc 



ccaaacagag 
tgaagagaat 
cttgaatgtc 
aaacagagaa 
ccttgctgcc 
atcaaaacat 
caatcacaaa 
atagaccaat 
caattcatgg 
attatctcaa 
catgataata 
aggatttctt 
taagctttgt 
tgtttatatg 
caaactttct 
catattgccc 
actgatagac 
atcacagatg 
ttctgaaaaa 
aattattttc 
tggtcaacat 
taaatccttt 
caatcagaat 
tatagaatat 
accttgagat 
gtaataataa 
ttttatgctt 
ggatatgggc 
tctgggagaa 
taaatgaaca 
tgtttgtttc 
attaaagtca 
cgttattgta 
gcttccgtag 
taaaatatta 
tttatttggc 
aaaaccaaag 
atttcttttt 
tgctctccac 
ctcttttggt 
gaagcaacat 
ggttaattaa 
gctctggacc 
ttcagtgcct 
gctaagaaga 
tctttggcaa 
ggaattgttt 
aatatgtctt 
t 



gaatctcctt 
gttacattaa 
tctgctataa 
aaggtgaatg 
cttgttgctg 
gtaaacaagg 
actgaagcct 
tgaagcatga 
ttcaagtatt 
gtaaaacaaa 
ggtcatatgt 
tttggtttgc 
tctgggagtt 
gataatctga 
cataagcacc 
taattaaaaa 
accataggag 
catatagaca 
gacataaaac 
tgtcaaaaga 
tcccattttc 
gggttatcca 
tggagttttt 
atattttttt 
aatttagtgt 
aatgtctatg 
gaggaaattt 
tacttgtgac 
cactgcacag 
tcagatttct 
ttagactcat 
gatgagaata 
catttgtaaa 
tgcattggta 
tgggaaaaaa 
aggaaaaaaa 
ttatttgttt 
tattttctct 
aataaatcac 
gtgggagatc 
gacaatagag 
gtccaaaaac 
gaaagcagaa 
ggataaagag 
aaaagaattc 
caggaaacag 
gtggaaatgg 
caaaaaaaaa 



atattaatgg 
cttgcacagc 
gtattccaga 
accaggcaaa 
gtgtcgtcta 
acctcggtaa 
aagagagaaa 
acgtggattg 
aagcagttca 
tgaaatttaa 
tgtgtttggt 
cttttatgta 
gctgttaatc 
gcaggtacat 
taaaacccaa 
ctgtgatttt 
ccgactcttt 
cacatacata 
agaatttggt 
aaaaacacaa 
atagatcaca 
ctgccttaaa 
aacagtggtc 
cgtgtgtgtt 
ttttaactga 
tatctttagt 
ttaaggtggt 
tagactttta 
cgatttcttt 
aggtatagaa 
gaaataaaaa 
ggaataatac 
ccatttactg 
tggattaaat 
aagaaaatac 
attgaatctt 
tgcatggcta 
ttagaatttt 
aaacactaaa 
aaaggtttaa 
agagttatgc 
agctgtcaga 
agtttgccag 
gaaagcttac 
caaatcctca 
gttttgcaag 
attaacatac 
aaaaaaaaaa 



caggtattat 
agaaaaccaa 
acacgatgag 
actaattgtg 
ctggctgtac 
tatggaagaa 
ctgtcctagt 
tatttaagac 
ttctaccaag 
ttacaaacaa 
tcaatttttt 
aattttttac 
tgatgtataa 
ttctgattct 
aggtggcagc 
tatcacaagg 
gatatgccac 
atggtactcc 
agcacttacc 
gcatgtgtga 
atgtaaatca 
attataccta 
attatcaaag 
tttgttaact 
tacataattt 
tacattcaaa 
agtataaatg 
aactttgctc 
cccaggattt 
ctatgttatt 
attatgaagg 
tttgccactt 
tctgggcaat 
gcataaaata 
gttattttgc 
ggtcaacatt 
agccattctg 
gttaaagaaa 
ataaaattac 
agtctaactt 
tacaattatt 
acctcgagag 
gaaaaaaaaa 
ttgtttaatg 
acttttgagg 
ttcaaggttc 
ccgtctatgc 
aaaaaaaaaa 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4 380 
4440 
4461 



<210> 64 
<211> 583 
<212> PRT 

<213> Homo sapiens 

<400> 64 

Met Glu Ser Lys Gly Ala Ser Ser Cys Arg Leu Leu Phe Cys Leu Leu 

15 10 15 

He Ser Ala Thr Val Phe Arg Pro Gly Leu Gly Trp Tyr Thr Val Asn 



wo 03/000012 



76 



PCT/US02/19773 



20 



Ser 


Ala 


Tyr 


Gly Asp 


Thr 


He 


ixe 
















40 


Gin 


Asn 


Leu 


rJe L. 


irne 


Gly Lys 


Trp 




50 










55 




Pro 


Val 


irne 




Al A 


trne 


Arg 


Ser 


65 










1 u 






Asp Asp 




Pro 


oXU 


lyr 


Lys 


Asp 










oo 








inr 


Leu 


Ser 


T 1 o 

JL xe 


Q A ^ 


Asn 


Ala 


Arg 








J. uu 








Cys 


Met 


Leu 


V ai 


i nr 


Glu 


Asp Asn 






1 1 










120 


Lys 


Val 


rne 


Lys 




Pro 


Ser 


Lys 




130 










135 




Phe 


Leu 


blu 


1 nr 




Gin 


Leu 


Lys 


145 










150 






Asp 


Ser 


Tyr 


Pro 


Asp 


Gly 


Asn 


He 










165 








Leu 


His 


rro 


Leu 


Glu 


Gly Ala 


Val 








180 










Asp 


Pro 


va± 


Thr 


Gin 


Leu 


Tyr 


Thr 






"1 

1 3^0 










200 


Thr 


Thr 


Lys 


Ala 


Asp 


ixe 


Gin 


Met 




210 










215 




Tyr Gly 


Pro 


Ser Gly 


bin 


Lys 


Thr 


225 










o 'a r\ 






Asp 


He 


Tyr 


Tyr 


Pro 


i nr 


Glu 


Gin 










245 








Lys Asn 


Aia 


He 


Lys 


oXU 


Gly Asp 








260 










Asn Gly 


Asn 


Pro 


Pro 


Pro 


Glu 


Glu 
















280 


Pro 


Glu 


Cjiy 


He 


Arg 


Ser 


Ser 


Asn 




290 










295 




Arg Asn 


AX a 


Thr Gly 


Asp 


Tyr 


Lys 


305 










310 






Met 


He 


Axa 


Ser 


Thr 


Ala 


He 


Thr 










325 








Asn 


Pro 


Ser 


Gly Glu 


Val 


Thr 


Arg 








340 










Ser 


Cys 


1 nr 


He 


Ser 


Ala 


Ser 


Arg 






O «J J 










360 


Asp Asn 


T T «a 

i±e 


Arg 


Leu 


Arg 


Ser 


Ser 




370 










375 




Gin Asp 




Gly Asn 


Tyr 


Val 


Cys 


385 










390 






Gly Leu 


Lys 


Lys 


Arg 


Glu 


Ser 


Leu 










405 








Gin 


He 


Lys 


Met 


Thr 


Lys 


Lys 


Thr 








420 










He 


He 


Cys 


His 


Val 


Glu 


Gly 


Phe 






435 










440 


He 


Thr 


Gly 


Ser Gly 


Ser 


Val 


He 




450 










455 




He 


Asn 


Gly 


Arg 


Tyr 


Tyr 


Ser 


Lys 


465 










470 






Val 


Thr 


Leu 


Thr 


Cys 


Thr 


Ala 


Glu 



485 



















1 X€ 


Pro 




Arg 


Leu 


Asp 


vax 


Pro 


















Lys 


Tyr 


1:}X U 


Lys 


Pro 


Asp 


oxy 


oer 








60 










• 

Ser 


1 nr 


Lys 


Lys 


Ser 


vax 


oxn 


Tyr 
















oU 


Arg 


Leu 


Asn 


Leu 


Ser 


UXU 


Asn 


Tyr 




QO 














Tip 


Qor 




Glu Lys 


rix y 


Ph A 

trne 


\7a 1 

vax 


lUD 










Tin 
1 i U 






V d JL 


Pho 


'aX U 


Ala 


Pro 


Thr 


X X e 


vax 










125 








Pro 


Pill 
bXU 


ixe 


Val 


Ser 


Lys 


AXa 


Leu 








140 










Lys 


Leu 


bxy 


Asp Cys 


He 


C i-\ -v- 

oer 


P1 11 

bXU 
















150 


i nr 


Trp 


Tyr 


Arg Asn Gly 


Lys 


Val 




X /U 










175 




vax 


ixe 


T 1 A 

ixe 


Phe 


Lys 


Lys 


Glu 


Met 


1 o D 










190 






K yl _ 4- 


i nr 


ber 


Thr 


Leu 


Glu 


Tyr 


Lys 










205 








ti O 


Pho 

rne 


i n X 


Cys 


Ser 


Val 


Thr 


Tyr 








220 










ixe 


nlS 


Ser 


Glu 


Gin 


Ala 


Val 


Phe 






O "3 c: 










240 


vax 


TV) V- 

1 nr 


ixe 


Gin 


Val 


Leu 


Pro 


Pro 




O c: n 










255 




Asn 


X xe 


Thr 


Leu 


Lys 


Cys 


Leu Gly 


9 c: 










270 






rfie 


Leu 


rne 


Tyr 


Leu 


Pro 


Gly 


Gin 










285 








i nr 


Tyr 


i nr 


Leu 


Met 


Asp 


Val 


Arg 








300 










L»ys 


Ser 


Leu 


He 


Asp 


Lys 


Lys 


Ser 






jXD 










320 


vax 


ms 


Tyr 


Leu 


Asp 


Leu 


Ser 


Leu 














335 




oxn 


1 xe 


(jXy 


Asp 


Ala 


Leu 


Pro 


Val 












350 






c n 


M J.ct 


Thr- 
ill L 


Val 


Val 


Trp 


Met 


Lys 










365 








Pro 


OCX 


Ph£a 


Ser 


Ser 


Leu 


His 


Tyr 








380 










C^ n 

V3X u 


Thr 
1 iix 


riXo 


Leu 


Gin 


Glu 


Val 


Glu 
















400 


Thr 


Leu 


Tie 

xxe 


Val 


Glu 


Gly 


Lys 


Pro 




fi xu 










415 




Asp 


Pro 

fro 


Ser 


Gly Leu 


Ser 


Lys 


Thr 












430 






r i O 


Lys 


Pro 


Ala 


He 


Gin 


Trp 


Thr 










445 








Asn 


Gin 


Thr 


Glu 


Glu 


Ser 


Pro 


Tyr 








4 60 










He 


He 


He 


Ser 


Pro 


Glu 


Glu 


Asn 






475 










480 


Asn 


Gin 


Leu 


Glu 


Arg 


Thr 


Val 


Asn 




4 90 










495 
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Ser 


T oil 




vox 




Ala 


He 


Ser 


T 1 o 

ixe 


Pro. 




n JL o 


Asp 


r* 1 1 1 
OiU 


AX a 


Asp 


Glu 


He 


Ser 

«3 ± «3 


Asp 


Glu 


Asn 


Arg 


Glu 
520 


Lys 


Val 


Asn 


Asp 


Gin 


Ala 


Lys 


Leu 


He 


Val 


Gly 


He 


Val 


Val 


Gly Leu 


Leu 


Leu 


Ala 


Ala 


Leu 


Val 


Ala 


Gly 




"5 r\ 










535 




















Val 


Val 


Tyr 


Trp 


Leu 


Tyr 


Met 


Lys 


Lys 


Ser 


Lys 


Thr 


Ala 


Ser 


Lys 


His 


545 










550 










555 










560 


Val 


Asn 


Lys 


Asp 


Leu 
565. 


Gly 


Asn- 


Met 


Glu 


Glu 
570 


Asn 


Lys 


Lys 


Leu 


Glu 
575 


Glu 


Asn 


Asn 


His 


Lys 
580 


Thr 


Glu 


Ala 





















<210> 65 . . 

<211> 2174 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 1910, 1941 
<223> n = A,T,C or G 

<400> 65 

agcaccacgc gtccgtgaag atggagacca acgagtctac ggagggatcg cggtcgcggt 60 
cgcggtgaga gccgcagctc tggctgcagg cataagggga cgaggaaggt cagctgactt 120 
cctctgctgc gcttttgaca gccatgtcgt gtttgctttc tttcagatct ttagacatac 180 
agcccagctc cgaaggactg gggcccactt cggaaccgtt tccttcttca gatgacagtc 240 
ccaggtcggc cctggcagct gcaaccgcag cagctgcagc ggctgcatca gctgctgcag 300 
ctactgcagc cttcaccact gccaaagcag ctgcattatc tacaaagacc ccagcgccct 360 
gttctgagtt catggagccg tcctctgacc ccagccttct tggggagccc tgtgcgggac 420 
ccggctttac ccacaatata gcccatggga gtcttggctt tgagcccgtc tatgtttcct 480 
gtattgctca ggacacttgc actacaactg accatagttc taatcctggc cctgttccag 540 
gctctagctc tgggcctgtt cttggttcca gctcaggtgc tggccatggc tctggctctg 600 
gctctggtcc tggctgtggc tctgtccctg gctctggctc tggtcctggt cctggctctg 660 
gtcctggtca tggctctggc tctcatcctg gtcctgcctc tgggcctggt ccagacactg 720 
gccctgactc tgagctcagc ccctgtattc ctccagggtt cagaaacctg gtggcagatc 780 
gggtccctaa ctatacctcc tggagtcagc actgcccctg ggagccccag aaacaaccac 84 0 
cttgggaatt tttgcaagtc ttagaaccgg gtgcccgagg actatggaaa cccccagaca 900 
ttaaagggaa gcttatggtt tgctatgaaa ctttgccacg gggccagtgc ctcctctaca 960 
actgggagga agaggtatta aagttttggc ctgctccctt ttcttgaagg ctgccctcag 1020 
tttcttaggg gaggcagtag tttacatgag ggtgggtacc agaagggata ttatagtcat 1080 
tcaacttggg atccacagag agccaccaac cacctggatc aagtcccaag catgcaggat 1140 
ggctctgaga gttttttctt ccgacacgga caccggggac tgctgactat gcaactaaag 1200 
tcacccatgc cctccagcac cacccagaaa gactcgtacc agccaccagg aaacgtctat 1260 
tggccacttc gagggaagcg tgaagccatg ctggagatgc tcctgcagca tcagatctgg 1320 
taagggattg ggtaaagggg aagagggatg gggaggagaa aaattgggtg agaatggcct 1380 
tgacacccct cgggctacat agtaaagagg tgcaggcaga acaggaaccc acaaggaagc 14 40 
tcttcgaggt tgagtctgtg acacaccatg actaccgaat ggagctggca caagcaggga 1500 
ctcctgcccc aacaaaggtg agaacccacc ccccatcccc cgccacttgc accagctggg 1560 
ctctgacagg ctgtggccaa gtaccaagcc cagaggttga gagagagggc tgaaggccag 1620 
gagttactca gtaccctccc tcacagcctc acgactaccg ccaggagcaa cctgagacct 1680 
tctggataca gagggcacca cagctgccgg tgtgtgaggg tgactaggtg ttgggggcag 1740 
agcggggcag gaaaggtagg gcagagttgt tttgttctgg cttggggaga gtgggatcca 1800 
tcctcatcct ggcactcctc cagggtgtca gtaacatcag gacattggac acaccattcc 1860 
ggaagaactg cagcttctca acacccagta cccttgtctc tggggaaacn ttttgcccta 1920 
tgaacctgag aattacccct naccaattgg gagaaaatat cttcccttcc ctgtcccgga 1980 
ggaaggctgg gtggtggagg ggggagaatg actcctttct gaggggtgag gagggaagtg 2040 



wo 03/000012 



78 



PCT/US02/19773 



gggtatggaa tatggaatct atttctgtct gcactagaga ggtcgggagg aagttaattc 2100 
tcactgymct tgaagaggct ttacataaag ggttctctct craaaaaaaa rawaraaaaa 2160 
aaaagggcgg ccgs 2174 

<210> 66 

<211> 287 

<212> PRT 

<213> Homo sapiens 



<400> 66 



Met 


Ser 


Cys 


Leu 


Leu 


Ser 


Phe 


Arg 


Ser 


Leu Asp 


He 


Gin 


Pro 


Ser Ser 


1 








5 










10 








15 


/"•111 




Leu 


biy 


Pro 


Thr 


Ser 


Glu 


Pro 


Phe Pro 


Ser 


Ser 


Asp 


Asp Ser 








20 










25 








30 




Pro 


Arg 


Ser 


Ala 


Leu 


Ala 


Ala 


Ala 


Thr 


Ala Ala 


Ala 


Ala 


Ala 


Ala Ala 






35 










40 








45 






Ser 


Ala 


Ala 


Ala 


Ala 


Thr 


Ala 


Ala 


Phe 


Thr Thr 


Ala 


Lys 


Ala 


Ala Ala 




50 










55 








60 








Leu 


Ser 


Thr 


Lys 


Thr 


Pro 


Ala 


Pro 


Cys 


Ser Glu 


Phe 


Met 


Glu 


Pro Ser 


65 










70 








75 








80 


Ser 


Asp 


Pro 


Ser 


Leu 


Leu 


Gly Glu 


Pro 


Cys Ala 


Gly 


Pro 


Gly 


Phe Thr 










85 










90 








95 


His 


Asn 


He 


Ala 


His 


Gly Ser 


Leu 


Gly 


Phe Glu 


Pro 


Val 


Tyr 


Val Ser 








100 










105 








110 




Cys 


lie 


Ala 


Gin 


Asp 


Thr 


Cys 


Thr 


Thr 


Thr Asp 


His 


Ser 


Ser 


Asn Pro 






115 










120 








125 






Gly 


Pro 


Val 


Pro 


Gly 


Ser 


Ser 


Ser 


Gly 


Pro Val 


Leu 


Gly 


Ser 


Ser Ser 




130 










135 








140 








Gly 


Ala 


Gly 


His 


Gly 


Ser 


Gly Ser 


Gly 


Ser Gly 


Pro Gly Cys 


Gly Ser 


145 










150 








155 








160 


Val 


Pro 


Gly 


Ser 


Gly 


Ser Gly 


Pro 


Gly 


Pro Gly Ser Gly Pro Gly His 










165 










170 








175 


Gly 


Ser 


Gly 


Ser 


His 


Pro Gly 


Pro 


Ala 


Ser Gly 


Pro Gly 


Pro Asp Thr 








180 










185 








190 




Gly 


Pro 


Asp 


Ser 


Glu 


Leu 


Ser 


Pro 


Cys 


He Pro 


Pro 


Gly 


Phe 


Arg Asn 






195 










200 








205 






Leu 


Val 


Ala 


Asp 


Arg 


Val 


Pro 


Asn 


Tyr 


Thr Ser 


Trp 


Ser 


Gin 


His Cys 




210 










215 








220 








Pro 


Trp 


Glu 


Pro 


Gin 


Lys 


Gin 


Pro 


Pro 


Trp Glu 


Phe 


Leu 


Gin 


Val Leu 


225 










230 








235 








240 


Glu 


Pro 


Gly 


Ala 


Arg 


Gly Leu Trp Lys 


Pro Pro Asp 


He 


Lys 


Gly Lys 










245 










250 








255 


Leu 


Met 


Val 


Cys 


Tyr 


Glu 


Thr 


Leu 


Pro 


Arg Gly Gin Cys 


Leu 


Leu Tyr 








260 










265 








270 




Asn 


Trp 


Glu 


Glu 


Glu 


Val 


Leu 


Lys 


Phe 


Trp Pro Ala 


Pro 


Phe 


Ser 






275 










280 








285 







<210> 67 

<211> 4305 

<212> DNA 

<213> Homo sapiens 



<400> 67 

cccttaataa gatttgccac gtacactcga 
acggtggctc tcgctgctcg cgccccctcc 
ctatgaatta tttatcgccg gcctaaaaat 
cggtggacat gggtggggcc ctggggccgg 
gggcagggct gggtccgggg cagggcgagc 



gccatcgcga gtgtccttga gccgcgggtg 60 
tcccgcgggg ggagcctgat gccacgttcc 120 
accccgaact tcacagcccg agtgaccctc 180 
ccctgttgct cacctcgctc ttcggtgcct 240 
agggcatgac ggtggccgtg gtgtttagca 300 
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gctcagggcc 
tacccctgga 
tcacccagat 
acgtggacac 
ccatcctcag 
ccttcctgca 
aagagtacga 
tcctggaggg 
tggtcacgct 
agctcgacgc 
ccgaggcggc 
cgctgggcag 
ccgagagctg 
tgggcgccca 
gtgttcaccc 
tcacctggga 
tggtggtgat 
gcgtcctata 
tggacagtcg 
gccccgaccc 
accacacctt 
gcatcgacat 
tgaccaacgg 
tgtactacaa 
agatcgtaga 
gcaatggcac 
tgatgtttgt 
gccctgtcag 
ctatcggcaa 
agaacccgcg 
tcttcctcgc 
acactgtgtc 
tccgcttcgg 
acatgcacac 
tcaagatggg 
gcaaggacga 
gctacggcat 
tgcagttcct 
gccagaatga 
tcttctacat 
tggtctactg 
ctttcagcag 
ggcaggccag 
aggcagcccg 
gcaccatcga 
cgcggtctgg 
cgggctgggg 
cgggccgccc 
cagcctggga 
ccgagcggcc 
ccggccgccc 
tcggtccgga 
cgcgcccgcg 
cgctgcccgc 
ggcgcttggc 
gcgagcaggc 
cccacctgcc 
gctcctggct 
gcacaggcta 



gccccaggcc 
gatccagccg 
ctgcggcctc 
cgaggcggtg 
catcagcgga 
gctgggcgtg 
ctggagcgcc 
cgtgcgcgcc 
ggagctgggc 
gcccgtgttt 
gcaggccggt 
caccgatgcg 
gcgcctcagc 
cagctactgg 
tgggcccgtc 
gggccgagac 
cgccctcaac 
catgaagtac 
gcacctgacg 
tggcacagga 
cagcagcggg 
cctcaagaag 
caagcatggc 
gcgggcagac 
cttctctgta 
cgtctccccc 
catgtgcctc 
ctacaaccag 
gtccgtgtgg 
gggcaccacc 
cagctacacg 
gggcctcagt 
cacggtgccc 
ccacatggtc 
gaagctggat 
gggctgcaag 
cgccatgcag 
gggggacgga 
gaagaacgag 
gctgctggtg 
gaagctgcgc 
gggcatctac 
cccggacctc 
cgacatggtg 
gaattggggt 
ccccagccca 
accgccagac 
cccgacgccg 
ggcgcggtgg 
cctgtcgccc 
cttcctcccg 
gcagctggcc 
tcacgcttcc 
tgggtgcacc 
gcaggcgcag 
aggggccccc 
attttgctgg 
ctccggggcc 
cagagacagt 



cagttccgtg 
ctcacagttg 
ctgggtgctg 
gcccagatcc 
ggctctgctg 
tccctggagc 
ttcgccgtca 
gtcgccgacg 
ccgggagggc 
gtggcctact 
ctggtggggc 
ccccccgcca 
ctgcgccaga 
cgccagcatg 
agccctgccc 
ttctccttca 
cggcaccgcc 
cctgtgtggc 
gtggccacgc 
ggctgcgtcc 
gacgtggccc 
ctggccagag 
aagcgggtgc 
atggccatcg 
ccctttgtgg 
tcggccttct 
actgtggtgg 
aacctcacca 
ctgctgtggg 
agcaagatca 
gccaacctgg 
gacaagaagt 
aacggcagca 
aagttcaacc 
gccttcatct 
ctggt caeca 
aaggactccc 
gagacacaga 
gtgatgagca 
gccatggggc 
cactcggtgc 
agctgcttca 
acggccagct 
accacggcgg 
ggcggccgcc 
tgcctgccca 
gggggtcgcg 
gggccgcccc 
ccggtgcgga 
gcgcgctgtc 
ctcttcccgg 
cggcgggagg 
ctgcccagct 
ggccccgcct 
tcgatgtgct 
gcctggcagc 
ggggctgtct 
tgggggcctc 
gggggactgg 



cccgcctcac 
gggtcaacac 
cccacgtcca 
ttgacttcat 
tggtcctcac 
agcagctgca 
tcaccagcct 
ccagccacgt 
cgcgcgcgcg 
gctcgcgcga 
ccggccacgt 
ccttccccgt 
aggtgcgcga 
gaaccctgcc 
gggaggcctt 
gccctggtgg 
tctgggagat 
ctcgctacag 
tggaagagag 
ccaacaccgt 
cctacaccaa 
tggtcaaatt 
gcggcgtatg 
gctccctcac 
agacgggcat 
tggagccata 
ccatcaccgt 
gaggcaagaa 
cgctggtctt 
tggttctggt 
ccgccttcat 
ttcagcggcc 
cggagcggaa 
agcgctcggt 
atgatgctgc 
ttgggtctgg 
actggaagcg 
aactggagac 
gcaagctgga 
tggccctgct 
ccaactcatc 
gcggggtgca 
cggcccaggc 
gcgtaagcag 
gtgcgccccc 
cccccgaccc 
cggcgcttgt 
tgtccgacgt 
ccgggcactg 
actacagctc 
agcccccgga 
ccctgctgca 
ccgtggccga 
gcgcccgccc 
tgccgatcta 
acagacagca 
gtcctcacct 
tggggcacag 
acgagatcag 



cccccagagc 
caccaacccc 
cggcattgtc 
ctcctcccag 
ccccaaggag 
ggtgctgttc 
gcacccgggc 
gagttggcgg 
cacgcagcgc 
ggaggccgag 
gtggctggtg 
gggcctcatc 
cggcgtggcc 
agccccggcc 
ctacaggcac 
gtacctggtc 
ggtggggcgc 
tgcctctctg 
gccctttgtc 
gccctgccgc 
gctctgttgt 
ctcctacgac 
gaacggcatg 
catcaatgag 
cagtgtgatg 
tagccctgca 
cttcatgttc 
gtccgggggc 
caacaactca 
ctgggccttc 
gatccaagag 
tcaagatcag 
catccgcagt 
ggaggacgcg 
tgtcctcaac 
caaggtcttt 
ggccatagac 
agtgtggctc 
catcgacaac 
ggtcttcgcc 
ccagctggac 
gagcctcgcc 
cagcgtgctc 
ctccctggac 
accgtccccc 
gcccccagag 
gcgcagggct 
ctcccgagtg 
cgggaggcac 
ctttcctcga 
gctggaggac 
cgcggcctgg 
ggccttcgct 
cgacggccac 
ccgggaggcc 
cgtctgcctg 
tccaccctgt 
gggcaggact 
cagtgtagcc 



ttcctggacc 
agcagcctcc 
tttgaggaca 
acccatgtgc 
ccgggctccg 
aaggtgctgg 
cacgcgctct 
ctgctggacg 
ctgctgcgcc 
gtgctcttcg 
cccaacctgg 
agcgtcgtca 
attctggccc 
ggggactgcc 
ctactgaatg 
cagcccacca 
tgggagcatg 
cagcccgtgg 
atcgtggaga 
aggcagagca 
aagggattct 
ctgtacctgg 
attggggagg 
gaacgctccg 
gtggctcgca 
gtgtgggtga 
gagtacttca 
ccagctttca 
gtgcccatcg 
tttgctgtca 
caatacatcg 
tacccacctt 
aactaccgtg 
ctcaccagcc 
tacatggcag 
gctaccactg 
ctggcgctct 
tcagggatct 
atggcaggcg 
tgggagcacc 
ttcctgctgg 
agcccaccgc 
aagatgctgc 
cgcgccactc 
tgcccgaccc 
ccgagcccca 
ccgcagcccc 
tcgcgccgcc 
ctctcggcct 
gccgaccgat 
ctgccgctgc 
gcccggggct 
cggcccagct 
tcggcctgca 
tgccaggagg 
cacgcccacg 
gccagccacg 
ctggggctgg 
cgtgggacgc 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 
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aaggcttccc gggaccctgc acctggagac 
ttatcagcca ctcaggctcc gagccagctg 
cggcaggcag gattgggctt ttctggcttc 
tgacagatga tgtcttccat ggtcatcagt 
ctgggctttt gctgtcctct tctcacgcag 
cagactcctg aggctctccc ttccctgggg 
gcatggaggc tggaacttgt ggttgaggca 
agagtttctt ctcatcagag cactgggaca 



ggatctccag tctggagtca gaagtgtgag 3900 
gattctctgc ctgccactgt cagggttaag 3960 
taccatgaaa tcctggccat gggaccccag 4020 
gacctcagta gcctcaaatc atggtgaggg 4080 
agttctgcca ggagggtgtg ctgtgggggt 4140 
ctagccagtt actggtcatg gctgctgtgg 4200 
gggccatccc gatccttgct ctacctggct 4260 
ttaaaccaac ctttt 4305 



<210> 68 
<211> 1236 
<212> PRT 

<213> Homo sapiens 



<400> 68 



Met 


biy 


biy 


Ala 


Leu 


biy 


Pro 


TV T 

Ala 


Leu 


Leu 


Leu 


Thr 


Ser 


Leu 


Phe 


Gly 


1 








5 










10 










15 




AX a 


Trp 


TV 1 ?i 

Ala 


Lily 


Leu 


Gly 


Pro 


Gly 


Gin 


Gly 


Glu 


Gin 


Gly Met 


Thr 


Val 








on 










CO 










30 






TV 1 -» 

Aia 


val 


V7-* 1 

Val 


pne 


Ser 


Ser 


Ser 


Gly 


Pro 


Pro 


Gin 


Ala 


Gin 


Phe 


Arg 


Ala 






35 










40 










45 






Arg 


Leu 


Thr 


Pro 


Gin 


Ser 


Phe 


Leu 


Asp 


Leu 


Pro 


Leu 
50 


Glu 


He 


Gin 


Pro 


Leu 


m ^ 

1 nr 


val 


(jly 


Val 


TV mvs. 

Asn 


Thr 


Thr 


Asn 


Pro 


Ser 


Ser 


Leu 


Leu 


Thr 


Gin 


65 










70 










75 










80 


lie 


Cys 


biy 


Leu 


Leu 

OD 


Gly 


Aia 


TV T 

Ala 


His 


Val 


His 


Gly 


He 


Val 


Phe 
95 


Glu 


Asp 


Asn 


vai 


Asp 

inn 
1 u u 


1 nr 


lalU 


TV 1 ^ 

Ala 


TT-* 1 

val 


Ala 

1 UO 


Gin 


lie 


Leu 


Asp 


Phe 
110 


He 


Ser 


Sezr 


oin 


1 nr 

115 


HIS 


vai 


Pro 


ixe 


Leu 

120 


Ser 


He 


Ser 


Gly 


Gly 

125 


Ser 


Ala 


Val 


\7ja 1 

va± 


Leu 


i nr 


Pro 


Lys 


tjjlU 


Pro 


biy 


Ser 


Ala 


fne 


Leu 


Gin 


Leu 


Gly Val 




1 ?n 




















1 fi u 












Leu 


f^l n 
L3J.U 


vaxn 


vjxn 


Leu 


L»in 


va 1 


Leu 


rne 


Lys 


vai 


Leu 


Glu 


Glu 


Tyr 


145 










150 










155 










160 


Asp 


irp 






irne 


Mia 


vaj. 


lie 


i nr 


Ser 


Leu 


nlS 


Pro Gly His Ala 










165 










170 










175 




Leu 


Phe 


Leu 


Glu 
180 


Gly 


Val 


Arg 


Ala 


Val 
185 


Ala 


Asp 


Ala 


Ser 


His 
190 


Val 


Ser 


Trp 


Arg 


Leu 


Leu 


Asp 


Val 


Val 


Thr 


Leu 


Glu 


Leu 


Gly 


Pro Gly Gly Pro 






195 










200 










205 








Arg 


Ala 
210 


Arg 


Thr 


Gin 


Arg 


Leu 
215 


Leu 


Arg 


Gin 


Leu 


Asp 
220 


Ala 


Pro 


Val 


Phe 


Val 


Ala 


Tyr 


Cys 


Ser 


Arg 


Glu 


Glu 


Ala 


Glu 


Val 


Leu 


Phe 


Ala 


Glu 


Ala 


225 










230 










235 










240 


Ala 


Gin 


Ala 


Gly 


Leu 
245 


Val 


Gly 


Pro 


Gly 


His 
250 


Val 


Trp 


Leu 


Val 


Pro 
255 


Asn 


Leu 


Ala 


Leu 


Gly 


Ser 


Thr 


Asp 


Ala 


Pro 


Pro 


Ala 


Thr 


Phe 


Pro 


Val Gly 








260 










265 










270 






Leu 


He 


Ser 


Val 


Val 


Thr 


Glu 


Ser 


Trp 


Arg 


Leu 


Ser 


Leu Arg Gin 


Lys 






275 










280 










285 








Val 


Arg 
290 


Asp 


Gly 


Val 


Ala 


He 
295 


Leu 


Ala 


Leu 


Gly 


Ala 

300 


His 


Ser 


Tyr 


Trp 


Arg 


Gin 


His 


Gly 


Thr 


Leu 


Pro 


Ala 


Pro 


Ala 


Gly 


Asp 


Cys 


Arg 


Val 


His 


305 










310 










.315 










320 


Pro 


Gly 


Pro 


Val 


Ser 
325 


Pro 


Ala 


Arg 


Glu 


Ala 
330 


Phe 


Tyr 


Arg 


His 


Leu 
335 


Leu 


Asn 


Val 


Thr 


Trp 


Glu 


Gly 


Arg 


Asp 


Phe 


Ser 


Phe 


Ser 


Pro Gly Gly Tyr 



340 345 350 
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Leu 


Val 


Gin 


Pro 


Thr 


Met 


Val 


Val 






o c c 
355 










360 


Trp 


^ 1 IS 

C^Xu 


Met 


vax 


Gly Arg Trp Glu 




OTA 










375 




Pro 


vax 


Trp 


Pro 


Arg 


Tyr 


Ser 


A J. a 


385 










390 






Arg 


nlS 


Leu 


1 nr 


Val 


Ala 


Thr 


Leu 










405. 








Glu 


Ser 


Pro 


Asp 


Pro Gly Thr 


Gly 








420 










Cys 


ArQ 


Arg 


bxn 


Ser 


Asn 


His 


Tnr 






435 










A A r\ 

440 


Tyr 


I nr 


Lys 


Leu 


Cys 


Cys 


Lys 


Gly 




450 










455 




Leu 


Ala 


Arg 


Val 


Val 


Lys 


Phe 


Ser 


4 65 










4 70 






Gly 


Lys 


His 


Gly 


Lys 


Arg 


Val 


Arg 






• 




485 








Glu 


Val 


Tyr 


Tyr 


Lys 


Arg 


Ala 


Asp 








500 










Asn 


Glu 


Glu 


Arg 


Ser 


Glu 


He 


Val 






515 










520 


Thr 


Gly 


lie 


Ser 


Val 


Met 


Val 


Ala 




530 










535 




Ser 


Ala 


Pne 


Leu 


Glu 


Pro 


Tyr 


Ser 


54 5 










550 






Val 


Met 


Cys 


Leu 


Thr 


Val 


Val 


Ala 










565 








Phe 


Ser 


Pro 


Val 


Ser 


Tyr 


Asn 


Gin 








580 










Gly 


Gly 


Pro 


Ala 


Phe 


Thr 


He 


Gly 






595 










600 


Leu 


Val 


Phe 


Asn 


Asn 


Ser 


Val 


Pro 




610 










615 




Ser 


Lys 


lie 


Met 


Val 


Leu 


Val 


Trp 


625 










630 






TV T _ 

Ala 


Ser 


Tyr 


Thr 


Ala 


Asn 


Leu 


Ala 










645 








lie 


Asp 


Thr 


Val 


Ser 


Gly 


Leu 


Ser 








660 










Asp 


(jxn 


Tyr 


Pro 


Pro 


Phe 


Arg 


Phe 






675 










boU 




Arg 


Asn 


xxe 


Arg 


Ser 


Asn 


Tyr 




690 










695 




Lys 


rfie 


Asn 


CjXn 


Arg 


Ser 


Val 


GXu 


inc. 










710 






p 1 t» 

vjxy 


Lys 


Leu 


Asp 


Ala 


Phe 


He 


Tyr 










725 








Ala 


Gly 


Lys 


Asp 


Glu Gly Cys 


Lys 








/4 U 










val 


Pne 


Axa 


1 nr 


Thr 


Gly 


Tyr 


Gly 






ICC 

/55 










760 


irp 


Lys 


Arg 


nxa 


He 


Asp 


Leu 


Ala 




770 










775 




Glu 


Thr 


Gin 


Lys 


Leu 


Glu 


Thr 


Val 


785 










790 






Glu 


Lys 


Asn 


Glu 


Val 


Met 


Ser 


Ser 










805 








Gly 


Val 


Phe 


Tyr 


Met 


Leu 


Leu 


Val 



He 


Ala 


Leu 


Asn 


Arg 


His 


Arg 


Leu 










o ^ c 
J d5 








His Gly 


vax 


Leu 


Tyr 


Met 


Lys 


Tyr 








3oU 










Ser 


Leu 


oxn 


Pro 


vax 


vax 


Asp 


ber 






Q R 










;! A A 

4 00 


Glu 


Glu 


Arg 


Pro 


IT ne 


V a± 


xxe 


vax 




410 










/I 1 C 

4X5 




Gly Cys 


vax 


Pro 


Asn 


X nr 


vax 


Pro 


















rne 


ber 


oer 


oxy 


Asp 


I/a 1 

vax 


AX a 


Pro 










4 4 5 








pne 


Cys 


xxe 


Asp 


xxe 


Leu 


Lys 


Lys 








4 oU 










Tyr 


Asp 


Leu 


Tyr 


Leu 


vax 


Thr 


Asn 






4 Id 










yi o A 

4 oU 


Gly 


Val 


Trp 


TV _ _ 

Asn 


bxy 


Met 


I Xe 


Gly 




4 90 










yi A c 
4 95 




Met 


Ala 


xxe 


CaXy 


Ser 


Leu 


Thr 


He 


505 










CIA 

510 






Asp 


Phe 


Ser 


Vax 


Pro 


Pne 


Val 


Glu 










c o c 
525 








Arg 


Ser 


Asn 


vjiy 


rp U, — 

i nr 


vax 


Ser 


Pro 








540 










Pro 


Ala 


vax 


Trp 


vax 


Met 


Met 


Pne 






c c c 

ODD 










560 


He 


Thr 


Val 


Phe 


Met 


Phe 


Glu 


Tyr 




570 










575 




Asn 


Leu 


Thr 


Arg 


Gly 


Lys 


Lys 


Ser 


585 










590 






Lys 


Ser 


Val 


Trp 


Leu 


Leu 


Trp 


Ala 










605 








He 


Glu 


Asn 


Pro 


Arg 


G Ly 


Tnr 


rnr 








620 










Ala 


Phe 


Phe 


Ala 


Val 


He 


Phe 


Leu 






635 










64 0 


Ala 


Phe 


Met 


lie 


Gin 


Glu 


Gin 


Tyr 




650 










655 




Asp 


Lys 


Lys 


Pne 


CaXn 


Arg 


Pro 


bin 


665 










o70 






Gly 


Thr 


vax 


Pro 


Asn 


bxy 


Ser 


fp V» V 

1 nr 










£0 c 

oo 5 








Arg Asp 


wet 


fix 5 


1 nr 


nxs 


Met 


vax 


















Asp Ala 


Leu 


1 nr 


Ser 


Leu 


Lys 


Jul-, 4. 

Met 






/ XD 










T 0 A 


Asp Ala 


riXa 


vax 


Leu 


Asn 


Tyr 


Me t 




730 










"7 "3 C 




Leu 


Val 


i. nr 


xxe 


oxy 


Ser 


VjXy 


Lys 


745 










T C A 

/5U 






He 


Ala 


Met 


oxn 


Lys 


Asp 


Ser 


nXS 










7 65 








Leu 


Leu 


oxn 


rne 


Leu 


vaxy 


Asp 


uxy 








780 










Trp 


Leu 


Ser 


Gly 


He 


Cys 


Gin 


Asn 






795 










800 


Lys 


Leu 


Asp 


He 


Asp 


Asn 


Met 


Ala 




810 










815 




Ala 


Met 


Gly 


Leu 


Ala 


Leu 


Leu 


Val 
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Q 0 R 

o ^: o 








830 






Phe 


Ala 


Trp 


Glu 


His 


Leu 


Val 


Tyr 

OHKJ 


Trp 


Lys 


Leu 


Arg 


His Ser 

O 4 0 


Val 


Pro 


Asn 


Ser 


Ser 


Gin 


Leu 


Asp 


Phe 


Leu 


Leu 


Ala 


Phe 


Ser 


Arg Gly 


He 


Tyr 




OiJ\J 










R R ^ 










OOU 






Ser 


Cys 


Phe 


Ser 


Gly 


Val 


Gin 


Ser 


Leu 


Ala 


Ser 


Pro 


Pro Arg 


Gin 


Ala 


O DZ) 










O / KJ 










0 /o 






o o rt 






Mop 


Leu 


Th-r 

n Q 
o 0 o 


rvxa 


Car* 


Cat* 


a 1 a 


Gin 
890 


Ala 


Ser 


vax Leu 


Lys 

one 

895 


Met 


Xjeu 






r\X<A 


Arg 


Asp 




Va 1 
VaX 


1 nz. 


Thr 


Ala 


v>xy 


vax ber 

y lu 


ber 


ber 


Leu 


Asp 


J\ r rr 

nx. y 

^ XD 


AT 3 


Th r 




Thr 




vjx u 


Asn 


Trp 


PI \7 

ox y 


Pl\7 Pit/ 

Kjzy oXy 

Q 0 Q 


Arg 


Arg 


Ala 


Pro 


Pro 


Pro 


Ser 


Pro 


Cys 


Pro 


Thr 


Pro Arg 


Ser 


Gly Pro 


Ser 


Pro 




930 










y 










y4 u 








Cys 


Leu 




1 ill. 


*r X U 


rlSp 




,f xU 


trx O 


Glu 


Pro 


Ser 


rro inr 


bxy 


Trp 


945 










you 










955 








o r\ 

yoU 


Gly Pro 




Asp 


o±y 


oxy 


rirg 


Ala 


Ala 


Leu 


Val 


Arg 


Arg Ala 


Pro 


GXn 




















970 








975 




Pro 


Pro 


Gly 


Arg 




t ro 


Thr 


Pro 


Gly 

985 


Pro 


Pro 


Leu 


Ser Asp 

990 


vax 


Ser 


Arg 


Val 


Ser 


Arg 


Arg 


Pro 


Ala 


Trp 


Glu 


Ala 


Arg 


Trp 


Pro Val 


Arg 


Thr 






995 










1000 








1005 






Gly 


His 


Cys 


bxy 


Arg 


His 


Leu 


Ser 


Ala 


Ser 


Glu 


Arg 


Pro Leu 


Ser 


Pro 




1010 








1015 








1020 






Ala 


Arg 


Cys 


His 


Tyr 


Ser 


Ser 


Phe 


Pro 


Arg 


Ala 


Asp Arg Ser 


Gly 


Arg 


1025 








1030 








1035 






1040 


Pro 


Phe 


Leu 


Pro 


Leu 


Phe 


Pro 


Glu 


Pro 


Pro 


Glu 


Leu 


Glu Asp 


Leu 


Pro 










1045 








1050 






1055 


Leu 


Leu 


Gly 


Pro 


Glu 


Gin 


Leu Ala Arg 


Arg 


Glu 


Ala 


Leu Leu 


His 


Ala 








1060 








1065 






1070 




Ala 


Trp 


Ala 


Arg 


Gly 


Ser 


Arg 


Pro 


Arg 


His 


Ala 


Ser 


Leu Pro 


Ser 


Ser 






1075 








1080 








1085 






Val 


Ala 


Glu 


Ala 


Phe 


Ala 


Arg 


Pro 


Ser 


Ser 


Leu 


Pro 


Ala Gly Cys 


Thr 




1090 








1095 








1100 






Gly 


Pro 


Ala 


Cys 


Ala 


Arg 


Pro Asp Gly 


His 


Ser 


Ala 


Cys Arg Arg 


Leu 


1105 








1110 








1115 






1120 


Ala 


Gin 


Ala 


Gin 


Ser 


Met 


Cys 


Leu 


Pro 


He 


Tyr 


Arg 


Glu Ala 


Cys 


Gin 










1125 








1130 






1135 


Glu Gly Glu 


Gin 


Ala 


Gly Ala 


Pro 


Ala 


Trp Gin 


His 


Arg Gin 


His 


Val 








1140 








1145 






1150 




Cys 


Leu 


His 


Ala 


His 


Ala 


His 


Leu 


Pro 


Phe Cys 


Trp Gly Ala 


Val 


Cys 






1155 








1160 








1165 




Pro 


His 


Leu 


Pro 


Pro 


Cys 


Ala 


Ser 


His 


Gly Ser 


Trp 


Leu Ser Gly Ala 



1170 1175 1180 



Trp Gly Pro Leu Gly His Arg Gly Arg Thr Leu Gly Leu Gly Thr Gly 
1185 1190 1195 1200 

Tyr Arg Asp Ser Gly Gly Leu Asp Glu He Ser Ser Val Ala Arg Gly 

1205 1210 1215 

Thr Gin Gly Phe Pro Gly Pro Cys Thr Trp Arg Arg He Ser Ser Leu 

1220 1225 1230 

Glu Ser Glu Val 
1235 



<210> 69 

<211> 1725 

<212> DNA 

<213> Homo sapiens 
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<400> 69 

gtcgacccac 

aggactggct 

tccccagcgc 

caactcccgc 

ggccgtcctg 

ggccttcatc 

ctgcgtggtg 

gccgcaggac 

gggcgtgctg 

caaggccaag 

agtgccggtg 

ctccgggcag 

gctgctcctt 

ctccgccaag 

ggctccactc 

accccaggag 

tgagccaggc 

ggccgcctcc 

agtgacaggg 

ctgactttgg 

tctgtctgcc 

cttgctcacc 

ggaagtgcag 

tgggcagtgg 

aggactctct 

ctctgcccct 

tcacccttgg 

taagaagagc 

acaggaggaa 



gcgtccggct 
ttatctcctg 
ttggaatcct 
ggacgctgaa 
ggctggctgg 
ggcagcaaca 
cagagcaccg 
ctgcaggcgg 
ctgtccgtgg 
accatgatcg 
tcctggacgg 
aagcgggaga 
ggcggggggc 
tattctgctg 
tgttcctctc 
ggccctgcca 
aggaaggcag 
tgctagcaag 
tgtggtggtg 
gatctgcctg 
tgcatctcct 
gaaacccgcg 
agtggatgga 
tgggggaggg 
gcctcacccg 
tccaaggaca 
aagtcctggg 
tattcatcac 
aaaaaaaaaa 



ggaaggaact 
actcacggtg 
acggccccca 

caatggcctc 
ccgtcatgct 
ttgtcacctc 
gccagatgca 
cccgcgccct 
tggggggcaa 
tggcgggcgt 
cccacaacat 
tgggtgcctc 
tgctttgctg 
cccgctctgc 
tgctttgttc 
cgggccactg 
cagccttcag 
aacagagtcc 
gagtggggag 
catcggtgtt 
ctgttgcggg 
cccaggagta 
cgggtttaga 
ggccagagag 
cttcagccca 
ctaatgagcc 
gtttttcctc 
tgtaattatt 
aaaaaaaaaa 



ggtctgctca 
caaaggtgca 
cagccggatc 

catggggcta 
gtgctgcgcg 
gcagaccatc 
gtgcaaggtg 
cgtcatcatc 
gtgtaccaac 
ggtgttcctg 
catccaagac 
gctctacgtc 
caactgtcca 
tgctgccagc 
ttccctggac 
gctgctgggg 
cctctctggc 
accctcctct 
ctggcttctg 
ggccactgtc 
taggccttga 
tggctgaggc 
ggggaggggc 
gcggctcagg 
gggcccctgg 
tgggagggtg 
ttccttcttt 
attattttct 
aaaaagggcg 



cacttgctgg 
ctctgcgaac 
ccctcagcct 

caggtaatgg 
ctgcccatgt 
tgggagggcc 
tacgactcgc 
agcatcatcg 
tgcctggagg 
ttggccggcc 
ttctacaatc 
ggctgggccg 
ccccgcacag 
aactacgtgt 
tgagctcagc 
actggggact 
ccactcggac 
ggatattggg 
ctggccagga 
cccatttaca 
tatcacctct 
cttgcccacc 
gaaggtgctg 
ttgcccagct 
agactgatcc 
gcagggagga 
gtggtttctg 
acaataaatg 
gccgc 



cttgcgcatc 
gttaagtccg 
tccaggtcct 

gcatcgcgct 
ggcgcgtgac 
tatggatgaa 
tgctggcact 
tggctgctct 
atgaaagcgc 
ttatggtgat 
cgctggtggc 
cctccggcct 
acaagcctta 
aaggtgccac 
gcaggctgtg 
gggcagagac 
aacttcccaa 
gagggacgga 
tggcttaacc 
ttttccccac 
gggactgtgc 
cacctgcctg 
taaacaggtt 
ctgtggcctc 
cctctgagtc 
ggggacagct 
ttttgtaatt 
ggacctgtgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1725 



<210> 70 
<211> 206 
<212> PRT 

<213> Homo sapiens 



<400> 70 



Met 


Gly 


Leu 


Gin 


Val 


Met 


Gly 


He 


Ala 


Leu 


Ala 


Val 


Leu 


Gly 


Trp 


Leu 


1 








5 










10 










15 




Ala 


Val 


Met 


Leu 


Cys 


Cys 


Ala 


Leu 


Pro 


Met 


Trp 


Arg 


Val 


Thr 


Ala 


Phe 








20 










25 










30 






He 


Gly 


Ser 


Asn 


He 


Val 


Thr 


Ser 


Gin 


Thr 


He 


Trp 


Glu Gly Leu 


Trp 






35 










40 










45 








Met 


Asn 


Cys 


Val 


Val 


Gin 


Ser 


Thr 


Gly 


Gin 


Met 


Gin 


Cys 


Lys 


Val 


Tyr 




50 










55 










60 










Asp 


Ser 


Leu 


Leu 


Ala 


Leu 


Pro 


Gin 


Asp 


Leu 


Gin 


Ala 


Ala 


Arg 


Ala 


Leu 


65 










70 










75 








80 


Val 


He 


He 


Ser 


He 


He 


Val 


Ala 


Ala 


Leu 


Gly 


Val 


Leu 


Leu 


Ser 


Val 










85 










90 










95 




Val 


Gly 


Gly 


Lys 


Cys 


Thr 


Asn 


Cys 


Leu 


Glu 


Asp 


Glu 


Ser 


Ala 


Lys 


Ala 








100 










105 










110 






Lys 


Thr 


Met 


He 


Val 


Ala 


Gly 


Val 


Val 


Phe 


Leu 


Leu 


Ala 


Gly 


Leu 


Met 






115 










120 










125 








Val 


He 


Val 


Pro 


Val 


Ser 


Trp 


Thr 


Ala 


His 


Asn 


He 


He 


Gin 


Asp 


Phe 




130 










135 










140 








Tyr 


Asn 


Pro 


Leu 


Val 


Ala 


Ser 


Gly 


Gin 


Lys 


Arg 


Glu 


Met 


Gly Ala 


Ser 


145 










150 










155 










160 


Leu 


Tyr 


Val 


Gly 


Trp 


Ala 


Ala 


Ser 


Gly 


Leu 


Leu 


Leu 


Leu 


Gly Gly Gly 
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165 170 175 . 

Leu Leu Cys Cys Asn Cys Pro Pro Arg Thr Asp Lys Pro Tyr Ser Ala 

180 185 190 

Lys Tyr Ser Ala Ala Arg Ser Ala Ala Ala Ser Asn Tyr Val 
195 200 205 



<210> 71 
<211> 5410 
<212> DNA 

<213> Homo sapiens 
<400> 71 

gtcgacccac gcgtccggct accgccgcgt tctattctcc gaagccggcg accgccccac 60 

ctcctccctc cctcccgccc gcttcctctg cccacagcgc cggccagagc gagctagaca 120 

agggcacgcg gggcctcgcc tagacccgag aagactgcgg gcgcgcgcaa gcggcggcgt 180 

ggaagctgtg agcgccccca tcccggaggt ctccgccggc tcccgggtga atcagctccc 240 

ggccgacttt aggattcttc tggattttaa attttttctt tttaaaaaaa cttggacgga 300 

taaaagatgt gccatggcag gatagcacca aagagcacct cagtgtttgc cgtggcctcc 360 

gtgggacatg gagtgttcct tccgctagtg atcctttgca ccctgcttgg agacggactt 420 

gcttccgtgt gccccctacc accggagcca gagaatggtg gctacatctg ccacccccgg 480 

ccctgcagag accccctgac agcaggcagt gtcatcgaat acctgtgtgc tgaaggctac 540 

atgttgaagg gcgattacaa atacctgacg tgtaagaatg gcgagtggaa accagccatg 600 

gagattagct gccgtctcaa cgaggataaa gacacccaca catcacttgg ggtccccacg 660 

ctgtctatag tggcttctac tgccagctcc gtggcgctca ttctcctcct cgtggtgctg 720 

tttgtgctgc tgcagccaaa gctgaagtct ttccatcata gcaggcgtga ccagggggta 780 

tctggggacc aggtctccat catggtggat ggagtccagg ttgcactacc atcatacgag 840 

gaggctgtat atggcagttc tggtcactgt gtgccacctg ctgaccccag agtacagatt 900 

gtgctgtcag aagggtctgg gcccagtggg aggagcgtgc caagggagca acagctgccg 960 

gaccaagggg cctgctcctc tgcaggtgga gaagatgagg ccccaggcca gtctggacta 1020 

tgtgaagcct ggggctctcg ggcctcagag actgtgatgg tgcatcaggc aaccacctct 1080 

tcctgggtgg ccggctcagg gaaccgccaa ctggcacaca aagaaactgc agattcagag 1140 

aacagtgaca tacaaagcct tttatccctc acgtcagagg agtacacaga tgatattcca 1200 

ctgttgaaag aagcatgagg gcagcggcca gcctttcctc tctgcgaggt tctctcagcc 1260 

cttcctccct ctccctgtgg gattgagcac cctgtactct ccagccacct tacctggata 1320 

cctgagctgc cacctgtgta tctgtgtatc tctgagggcc ctataggccc accttgctgg 1380 

aaactcaagg aagattctcg ccatctgcct gttggacagc tggaggagct ggctctttgc 1440 

ctggccccgc cttcccatct gtcagagaca tatttgaatg tgctggatca aaccctccct 1500 

tttcctaagc ctctgggtcc cctccagcca gctctttggc ggcagccccc accagctcct 1560 

gtgggcctga gtgctgctgt gtttacttgt gcctttcccc caccctgtcc agtttccctg 1620 

tcatgcagac ttgttgctgt ccacaagcct tagtggctgc actgctgccc cctgccacac 1680 

sgggggccgg gcctgggtct gtcctgtttc ctttgagggt tgcccctact gccctttgca 174 0 

ggaacagatc caggtgtgag agctcttgag tcaagagtgg cagaagtggc tctaattggg 1800 

gtgagagtgt agtccctggg cttgccctgg gttgaccctg gtggcatatt tccttggctg 1860 

aggatggaag atttggagaa tcatgtccat gctggcccag gacccagcca tctggcccaa 1920 

aggcacaagc tcctggccct gttgagttga gagtttccaa gaagcatcca gaagatccca 1980 

agggagagaa ggaaaatggc tgataatgat tgtcttccta atatgcaagt tctcacttcc 204 0 

tacttccagc atcggccttc ctggccttgt cttttttttg tttccctgga gtataatggg 2100 

aagttgcatg ctgcctcctg ggttttatcc cagatagctc tggctttctt gctgcccaca 2160 

ggggcctggg gcaggaagga gacttgctga gatgccatgg agtgcccatc tggtcactgg 2220 

cagtctgggc aggttgcccc tttctgggtt tgtggtgacg gaggggaggc cgagaggcac 2280 

agaccaagtc cccgggtggc tgcaggcagc tccagcccgg tcctgaggat cctcctcacc 2340 

atggtcacgt gccttagtaa ctgtgcccag gaagtggcct gctgcttgct gtgctgctgc 2400 

ttttcctact tctgcccttc cctgccaccc ctcgcatgtc acagctgaca agcaattcct 2460 

tgtcttccct ggccccctgg gggaagggct gagaaacagt ccgtgtgcac cccaacctta 2520 

atggcctgag gtgggcagag gggtgtggag cagcctggag tacagggccc tgggggagga 2580 

gcccactgat gaggggcgct ctcccatagc catgtgttga atgctaacta ggctggggtg 2640 

gacgaactct gccaactgct gtcatcttag aagatagatg cagcagtaag gaatgtttgt 2700 

tttgcttttt tctgaaattt tctgaagcac tgtggctggg aaacttcgaa gcggaccctg 2760 
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tgctgcatgt ctgctcctcc cctgagcctg 
cagtttattt tcagtacctt acctaacagg 
tgaggggagt ggtcttctcc cagcctttta 
acgcacttta ccacccggtc attccctggc 
tcctctcagg gtaagggcag tgcctgctgt 
'cccaggagcc ctcatctgct gtgctgagtc 
tggagaaggt gctagaacta gaaggcagat 
tccctaatgg ccccagtcgc cttacctcac 
cagaaagcag tgacaaaagg gggaggggtg 
gcttttattt ttcactgttt caaaacccgc 
agatcttacc tttttctatc ttgttactct 
tttgtttggg gtttattcag ctgcatagat 
ccttccttct cctgaagcct ctttccgccc 
agcccaggct tgcccacctg gtaggaaagg 
ccctctcgtc tcctcccctc cttcttgtca 
ctcctgggcc tgtgagtgca ggagctcatt 
tcttcctcca gttatgtctt tcttccaaag 
atttctaggg cttctgggct ttgtccctta 
gaggtagggt ctggctgaga gacaagggta 
ctgcctgagg agcctggagc aggtggaaac 
taccgggctc acctctaggg cattccagca 
aaaggacatt taaaagagtt tttgggtttt 
tttttttttt ttttttggca cacttgagct 
ttgcagtcac attctaatga ctttcaaggg 
ccgtcccttc catgccttag tttagcaggt 
atgtgctgtg ttcccgtttc actgggtatg 
ttatgagttt gtttggtttt ataaggcatt 
catttgcaaa cagatcttaa gctaatattt 
tcccgccagc tttaaagttc agtggagaag 
ttcctgcttc atgggtggtg gcacgggaat 
ccagctgagc cctcagacca cttgcttccc 
acatccttgc gtagagaatg aaatatgctg 
ccaaagaaaa aaaaaaggcc tagcagggaa 
aggacagcga gtgaggctgg gagcttctct 
tagggacagt ccttactggt gccccaaggt 
gtttggtctg aatggtgtag ttgctggttc 
ttgctgggaa atggctctta atttccagtt 
tcaaggttga gcccctctgc caagcagcag 
actcagttgc caagcaagga ggagagatgt 
gccctggctt cttcaggtta tcgcaccact 
aatggtttaa aacaacacat tcataattga 
ggtttgcttt attttatttt atatatatat 
tgaagatagt attttaatat ttgtacaaag 
taactgcatt tctaaataat taaaaaaaag 
gggcggccgc 



tctgcttggg ggtggtaaaa ataaaaatcc 2820 
gttggctcca ggcgtgggtg gcctagaaga 2880 
ccctcttgcc tcctgcctcc gcgcttacac 2940 
ctcttgctgc cacttgtagt cttccttcct 3000 
gcctgttggc cactcccaca cttcccctcc 3060 
caggaaagca tagttaggta gggagctggt 3120 
gagactagca tgggcccacc tggagggctg 3180 
ccacagcagt gcccttgtct tcctccaaaa 3240 
gtaatctgaa gtctcactgc tgagccttca 3300 
attctattct agaatggttt ttaaaatgga 3360 
ggggttttgt ccccctaaga gattgcactt 3420 
gaccagcttg atccctggtg aaatgaaaag 3480 
tgccctccac taacaacact gaggagcaca 3540 
aagaaattag aacaatggga gccttggctc 3600 
ctggctttga tgaggcccac ttcccagagg 3660 
ctcccctcac tgctgaagtc tgtgacagct 3720 
caatttctta accatcagcc atgtgctgct 3780 
ctgagagatt agggactcca cagctgcctt 3840 
gcagcaggtg gcaggctgtt aaaagacagg 3900 
aggtggaaga aaccggccac agccctgctt 3960 
agaggctgat gcaggagaat ggccagcacc 4020 
tttgtttgtt tgttgttggt gtttgttttt 4080 
gactcagtgc aggtttaata tcctggtgac 4140 
ccagaatatg gtgaaaatca cttaaaatat 4200 
aggctctatc ttttgccatt tctgtatttt 4260 
aactgtgaaa tcgactgaat cctggccact 4320 
tcaatgtaca ttctataaat acaagcactc 4380 
tctttcccat tcatcttgcc ctccccctcc 4440 
ccagatggca attcagacaa aggtatactc 4500 
agatagccct tagccctttc cctcccagtc 4560 
acataacaat gtcgcctcca tttccgagga 4 620 
caatcatttc tgcatcctta ctcctcaccc 4680 
gcagcatgca ggcttcacag cttaatgcca 4740 
tgggcctgct gggtctgtca gctctcggaa 4800 
gggacttgga gaatattttg cttggcatat 4860 
cctagagagg aaaaggtggc aggcccagct 4 920 
gaaaccctag tagaattgtg aatgaaaacc 4 980 
agctagtaga aggggatgca ggggcaaagc 504 0 
acgtgggctg tgtggcagtc cccacaccct 5100 
atggaatcct ttgcagaatg gtactcatat 5160 
ctctgtgcag gatgtcactc aatcagtttg 5220 
tttttggtat cctgtacatt gcagtgggtg 5280 
tttaatttaa ttttaattgt tctatgtata 5340 
ttcttatgaa aaaaaaaaaa aaaaaaaaaa 5400 

5410 



<210> 72 

<211> 303 

<212> PRT 

<213> Homo sapiens 



<400> 72 
Met Cys His Gly 
1 

Ala Ser Val Gly 
20 

Leu Leu Gly Asp 
35 

Glu Asn Gly Gly 



Arg lie Ala Pro 
5 

His Gly Val Phe 

Gly Leu Ala Ser 
40 

Tyr lie Cys His 



Lys Ser Thr Ser 
10 

Leu Pro Leu Val 

25 

Val Cys Pro Leu 
Pro Arg Pro Cys 



Val Phe Ala Val 
15 

lie Leu Cys Thr 
30 

Pro Pro Glu Pro 
45 

Arg Asp Pro Leu 
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50 






Thr 


Ala 


biy 


ber 


65 








Lys 


Giy 


Asp 


Tyr 


AX a 


Me L 


blU 


T 1 £1 

lie 








100 


Ser 


Leu 


Gly 


Val 






115 




vaj. 


Ala 


Leu 


lie 




i JU 






Lys 


Leu 


Lys 


Ser 


14 D 








Asp 


bin 


Val 


Ser 


Tyr 


blU 


blU 


Ala 








180 


Asp 


Pro 


Arg 


Val 






195 




Arg 


ber 


vai 


Fro 




210 






ser 


Ala 


Gly 


Gly 


o o c 

225 








Ala 


Trp 


Gly 


Ser 


Thr 


Ser 


Ser 


Trp 








260 


Glu 


Thr 


Ala 


Asp 






275 




Thr 


Ser 


Glu 


Glu 




290 











55 




Val 


lie 


Glu 


Tyr 




70 






Lys 


Tyr 


Leu 


Thr 










Ser 


Cys 


Arg 


Leu 


Pro 


inr 


Leu 


Ser 








120 


Leu 


Leu 


Leu 


Val 






135 




Phe 


His 


His 


Ser 




150 






T 1 A 

lie 


Met 


Val 


Asp 


165 








val 


Tyr Gly 


Ser 


bin 


He 


Val 


Leu 








200 


Arg 


Glu 


Gin 


Gin 






215 




Glu 


Asp 


Glu 


Ala 




230 






Arg 


Ala 


Ser 


Glu 


245 








Val 


Ala 


Gly 


Ser 


Ser 


Glu 


Asn 


Ser 








280 


Tyr 


Thr Asp 


Asp 






295 





60 

Leu Cys Ala Glu 
75 

Cys Lys Asn Gly 
90 

Asn Glu Asp Lys 
105 

He Val Ala Ser 

Val Leu Phe Val 

140 

Arg Arg Asp Gin 
155 

Gly Val Gin Val 
170 

Ser Gly His Cys 
185 

Ser Glu Gly Ser 

Leu Pro Asp Gin 
220 

Pro Gly Gin Ser 
235 

Thr Val Met Val 
250 

Gly Asn Arg Gin 
265 

Asp He Gin Ser 

He Pro Leu Leu 
300 



Gly Tyr Met Leu 
80 

Glu Trp Lys Pro 
95 

Asp Thr His Thr 
110 

Thr Ala Ser Ser 
125 

Leu Leu Gin Pro 

Gly Val Ser Gly 
160 

Ala Leu Pro Ser 
175 

Val Pro Pro Ala 
190 

Gly Pro Ser Gly 
205 

Gly Ala Cys Ser 

Gly Leu Cys Glu 
240 

His Gin Ala Thr 
255 

Leu Ala His Lys 
270 

Leu Leu Ser Leu 
285 

Lys Glu Ala 



<210> 73 

<211> 4392 

<212> DNA 

<213> Homo sapiens 

<400> 73 

gtcgacccac gcgtccgggc cgtccaggct agcggcggcc cgcaggcggc ggggagaaag 60 

actctctcac ctggtcttgc ggctgtggcc accgccggcc aggggtgtgg agggcgtgct 120 

gccggagacg tccgccgggc tctgcagttc cgccgggggt cgggcagcta tggagccgcg 180 

gcccacggcg ccctcctccg gcgccccggg actggccggg gtcggggaga cgccgtcagc 24 0 

cgctgcgctg gccgcagcca gggtggaact gcccggcacg gctgtgccct cggtgccgga 300 

ggatgctgcg cccgcgagcc gggacggcgg cggggtccgc gatgagggcc ccgcggcggc 360 

cggggacggg ctgggcagac ccttggggcc caccccgagc cagagccgtt tccaggtgga 4 20 

cctggtttcc gagaacgccg ggcgggccgc tgctgcggcg gcggcggcgg cggcggcagc 480 

ggcggcggct ggtgctgggg cgggggccaa gcagaccccc gcggacgggg aagccagcgg 54 0 

cgagagcgag ccggctaaag gcagcgagga agccaagggc cgcttccgcg tgaacttcgt 600 

ggacccagct gcctcctcgt cggctgaaga cagcctgtca gatgctgccg gggtcggagt 660 

cgacgggccc aacgtgagct tccagaacgg cggggacacg gtgctgagcg agggcagcag 720 

cctgcactcc ggcggcggcg gcggcagtgg gcaccaccag cactactatt atgataccca 780 

caccaacacc tactacctgc gcaccttcgg ccacaacacc atggacgctg tgcccaggat 84 0 

cgatcactac cggcacacag ccgcgcagct gggcgagaag ctgctccggc ctagcctggc 900 

ggagctccac gacgagctgg aaaaggaacc ttttgaggat ggctttgcaa atggggaaga 960 

aagtactcca accagagatg ctgtggtcac gtatactgca gaaagtaaag gagtcgtgaa 1020 

gtttggctgg atcaagggtg tattagtacg ttgtatgtta aacatttggg gtgtgatgct 1080 

tttcattaga ttgtcatgga ttgtgggtca agctggaata ggtctatcag tccttgtaat 1140 

aatgatggcc actgttgtga caactatcac aggattgtct acttcagcaa tagcaactaa 1200 
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tggatttgta 
tggtggtgca 
ggttggattt 
tgaaatcaat 
agtagctgga 
tgctattggt 
gttttttggt 
agagactttc 
agcaaatatc 
agccatttta 
tgttcgagat 
ttctgcagcc 
tggcctaatg 
tgcaggtata 
aatatttcag 
ttatgggaaa 
attcatctta 
atcatatgca 
gcgtcctgca 
catagtaatg 
gctgtatatt 
cctgacttac 
gaaaaacttt 
acttcatctt 
tatgggtcct 
atggcttatt 
agaaggtgca 
tgtccttgga 
aaacttattt 
aggtctggat 
tcctggcacc 
ttccaaacca 
tgcaactcaa 
tgaccaaaag 
tgatgtctgg 
gaccaagaaa 
aatagaccat 
tgatatcatg 
tgaggaaatc 
taaaatgaaa 
caagacatac 
tattattgtc 
ggcatggtta 
tcagagtgtc 
gtacttcagt 
gaatggtgac 
ttgataacgt 
ttcttctctg 
aaaagcgtgt 
ccttctgaaa 
aamwwwrininw 
gcttgtcttc 
aaaggtatct 
gggaaatggt 



agaggaggag 
attggtctaa 
gcagaaaccg 
gatatccgaa 
atggagtggg 
gatttcgtca 
tataaatctg 
ttttctgtat 
tcaggtgatc 
attactacat 
gccactggaa 
tgcaaattaa 
aacaacttcc 
ttttcagcca 
gctctatgta 
aataatgaac 
attgctgaac 
ttgatcaatt 
ttcaaatact 
ttcgtcatta 
tatgttacct 
ctgaatgcac 
aggccacagt 
gttcatgatt 
cgaagacaag 
aagaacaaaa 
cagtatttga 
tttaagaaag 
catgatgctt 
atatctcatc 
aaggatgtgg 
ctcagtgaaa 
ccactgttga 
cttcttgaag 
tggctttttg 
aaatggaaag 
gaccggagag 
gttctaggag 
attgagccat 
gaagatgaac 
cggcagatca 
atgagtctcc 
gaagctctat 
cttaccttct 
gcctagtgta 
ttttctttca 
gtatggagac 
ttgaactgaa 
taactttttg 
ttaaccaaat 
wmaamwrwww 
ttccattgac 
gttagatatt 
aa 



gagcatatta 
tcttcgcctt 
tggtggagtt 
ttattggagc 
aagcaaaagc 
taggaacatt 
aaatatttaa 
ttgccatctt 
ttgcagatcc 
tggtttacgt 
acgttaatga 
actttgattt 
aggtaatgag 
ctctttcttc 
aggacaacat 
ctcttcgtgg 
tgaatgttat 
tttcagtatt 
acaacatgtg 
actggtgggc 
acaaaaaacc 
tgcagcattc 
gtcttgttat 
tcacaaaaaa 
ccatgaaaga 
tgaaggcatt 
tgcaggctgc 
attggttgca 
ttgacataca 
ttcaaggaca 
tagtaagtgt 
aaccaattac 
aaaaagaatc 
ctagtacaca 
atgatggagg 
actgtaagat 
cgatggctac 
atatcaatac 
acagacttca 
catggcgaat 
ggttaaatga 
cagttgcacg 
ctaaggacct 
attcataaat 
gtaactgaaa 
cgatttcatt 
ttcggtttta 
gtttgtgaga 
attgatgaaa 
ttcatccata 
wwawwwmwar 
catttagtgt 
aaaggtgaga 



tttaatatct 
tgccaacgct 
gcttaaggaa 
cattacagtc 
tcagattgtt 
tatcccactg 
tgagaacttt 
ttttcctgct 
tcagtcagcc 
aggaattgca 
cactatcgta 
ttcatcttgt 
tatggtgtca 
agcattagca 
ctacccagct 
ctacatctta 
tgcaccaatt 
ccatgcatca 
gatatcactt 
tgcattgcta 
agatgtgaat 
aattcgtctt 
gacaggtgct 
tgttggtttg 
gatgtccatc 
ttatgctcca 
tggtcttggt 
agcagatatg 
atatggagta 
agaagaatta 
ggaatatagt 
acacaaagtt 
caaaggccct 
gtttcagaaa 
tttgacctta 
cagagtattc 
tttgcttagc 
caaaccaaag 
tgaagatgat 
aacagataat 
gttattaaag 
aaaaggtgct 
accaccaatc 
gttctataca 
tcttcaatga 
aatttgaaag. 
gtcaattcca 
gtagttttcc 
gaagtacaaa 
tatcctcttt 
wm wm w wmm am 
tgagtactgt 
attagggcag 



agaagtctag 
gttgcagttg 
cattccatac 
gtgattcttt 
cttttggtga 
gagagcaaga 
gggcccgatt 
gcaactggta 
atacccaaag 
gtatctgtag 
acagagctaa 
gaaagcagtc 
ggatttacac 
tccctagtga 
ttccagatgt 
acattcttaa 
atctcaaact 
cttgcaaaat 
cttggagcaa 
acatatgtga 
tggggatcct 
tctggagtgg 
ccaaactcac 
atgatctgtg 
gatcaagcca 
gtacatgcag 
cgtatgaagc 
agggatgtgg 
gtggttattc 
ttgtcatcac 
aaaaagtccg 
gaggaagagg 
attgtgcctt 
aaacaaggaa 
ttgatacctt 
attggtggaa 
aagttccgga 
aaagaaaata 
aaagagcaag 
gagcttgaac 
gaacattcaa 
gtgtctagtg 
ctcctagttc 
gtggacagcc 
cacattaaca 
cacacaggaa 
tatctcaatc 
tttgctactt 
aagcctttag 
tataaactta 
wwwaaaamaa 
atgtgttttg 
gttaatcaaa 



ggccagaatt 
ctatgtatgt 
ttatgataga 
taggtatctc 
tcctacttct 
agccaaaagg 
ttcgagagga 
ttctggctgg 
gaacactcct 
gttcttgtgt 
caaactgtac 
cttgttccta 
cactaatttc 
gtgctcccaa 
ttgctaaagg 
ttgcacttgg 
tcttccttgc 
ctccaggatg 
ttctttgttg 
tagtccttgg 
ctacacaagc 
aagaccacgt 
gtccagcttt 
gccatgtaca 
aatatcagcg 
atgacttgag 
caaacacact 
atatgtatat 
gcctaaaaga 
aagagaaatc 
atttagatac 
atggcaagac 
taaatgtagc 
agaatactat 
accttctgac 
agataaacag 
tagacttttc 
ttatagcttt 
atattgcaga 
tttataagac 
gcacagctaa 
ctctctacat 
gtgggaatca 
ctccagaatg 
tcacaatggc 
agttgctcca 
ttaatggtga 
gaatagcaat 
ccttgaggtg 
tagaatgtca 
aawraamact 
ttaattctat 
aatggggaag 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4392 



<210> 74 
<211> 1212 
<212> PRT 

<213> Homo sapiens 
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<400> 74 



Met u±u 


Pro 


Arg 


Pro 


inr 


Ala 

Ala 


rro 


ber 


Ser 


biy 


Aia 


Pro 


Gly 


Leu 


aia 
Ala 


1 






c 
D 










1 A 
lU 








15 




biy vai 


ijiy 


rill 

ulU 


Tnr 


rro 


ber 


A 1 sa 

Ala 


Aia 


Aia 


Leu 


ai a 

Ala 


Ala 


A 1 » 

Ala 


Arg 


\7^ 1 

Val 






^ U 










23 










A 

30 






Glu Leu 


Pro 


bly 


inr 


Ala 


Vai 


Pro 


ber 


V ai 


Pro 


blU 


Asp 


Ala 


Ala 


Pro 














/I A 










A c 








Ala Ser 


Arg 


Asp 


Gly 


Gly 


Gly 


Vai 


Arg 


Asp 


blU 


biy 


rj ^ 

rro 


Ala 


Ala 


A 1 ^ 

Ala 












c c 
55 










oU 










Gly Asp 


oiy 


Leu 


biy 


Arg 


Pro 


Leu 


oiy 


rro 


1 nr 


rro 


Ser 


bin 


C A V 

Ser 


Arg 


c c 
DO 








/U 










~I c 

/5 










80 


rne bin 


vai 


Asp 


Leu 


Vai 


Ser 


Glu 


Asn 


A 1 ^ 

Ala 


biy 


Arg 


Aia 


A 1 -1 

Ala 


Ala 


TV 1 • 

Ala 








85 










OA 

90 










A C 

95 




Ala Hia 


Aia 


Ala 


Aia 


Ala 


Ala 


Ala 


Aia 


Aia 


Aia 


biy 


Aia 


biy 


Ala 


biy 






1 UU 










T A C 

lUo 










1 1 A 

1 lu 






Ala Lys 


bin 


1 nr 


Pro 


Ala 


Asp 


biy 


blU 


Ala 


ber 


biy 


blU 


ber 


blU 


Pro 




llD 










120 










125 








Ala Lys 


Gly 


Ser 


Glu 


Glu 


Ala 


Lys 


Gly 


Arg 


Pne 


Arg 


Val 


Asn 


Phe 


Val 


130 










135 










1 il A 

140 










Asp Pro 


Ala 


Ala 


Ser 


ber 


Ser 


A 1 -1 

Ala 


blU 


Asp 


Ser 


Leu 


Ser 


Asp 


Ala 


Ala 


14 5 








150 










155 










160 


biy vai 


biy 


vai 


Asp 


Gly 


Pro 


Asn 


val 


ber 


Pne 


Gin 


Asn 


Gly 


Gly 


Asp 








165 










170 










175 




Tnr Vai 


Leu 


Ser 


Glu 


Gly 


Ser 


Ser 


Leu 


His 


Ser 


Gly 


Gly 


Gly 


Gly 


Gly 






180 










185 










190 






Ser Gly 


His 


His 


Gin 


His 


Tyr 


Tyr 


Tyr 


Asp 


Thr 


His 


Thr 


Asn 


Thr 


Tyr 




195 










200 










205 








Tyr Leu 


Arg 


rp u, „ 

Tnr 


Pne 


Gly 


His 


Asn 


Thr 


Met 


Asp 


Ala 


Val 


Pro 


Arg 


He 


2].0 










215 










220 










Asp His 


Tyr 


Arg 


His 


Thr 


Ala 


Ala 


Gin 


Leu 


Gly 


Glu 


Lys 


Leu 


Leu 


Arg 


225 








230 










235 










240 


Pro Ser 


Leu 


Ala 


blU 


Leu 


nlS 


Ann 

Asp 


blU 


Leu 


blU 


Lys 


Glu 


Pro 


Pne 


Glu 








245 










250 










255 




Asp Gly 


Pne 


Ala 


Asn 


Gly 


Glu 


Glu 


Ser 


Thr 


Pro 


Thr 


Arg 


Asp 


Ala 


Val 






2 50 










265 










270 






Vai inr 


Tyr 


rnr 


Ala 


blu 


Ser 


Lys 


Gly 


Val 


Vai 


Lys 


ITU _ 

Pne 


Gly 


Trp 


He 




O T C 

2 /5 










O O A 

280 










285 








Lys Gly 


vai 


T mi 

lieu 


vai 


Arg 


Cys 


Met 


Leu 


A MM 

Asn 


lie 


Trp 


biy 


Val 


Met 


Leu 


290 










one 

295 










300 










fne lie 


Arg 


Leu 


Ser 


Trp 


He 


Vai 


Gly 


Gin 


Ala 


Gly 


He 


Gly 


Leu 


Ser 


n c 








olO 










315 










O A A 

320 


1 T oil 

vai lieu 


vai 


T 1 *!» 

lie 


Met 


Met 


A 1 

Ala 


Inr 


XT-, 1 

Val 


Val 


1 nr 


rpL _ 

rnr 


He 


Tnr 


Gly 


Leu 


















O O A 










335 




ber inr 


Ser 


Ala 


lie 


A 1 -^i 

Ala 


1 nr 


Asn 


Gly 


rne 


vai 


A 

Arg 


biy 


biy 


biy 


Ala 
















34 5 










•3 C A 

350 






Tyr Tyr 


Leu 


lie 


ber 


Arg 


Ser 


Leu 


Gly 


Pro 


blU 


rne 


Gly 


Gly 


Ala 


He 
























ODD 








Gly Leu 


lie 


rne 


Ala 


Fne 


A 1 -a 

Ala 


A MM 

Asn 


Ala 


vai 


A 1 -i 

Ala 


Val 


Ala 


Kit 4- 

Met 


Tyr 


Val 


'in f\ 
J /U 










J /D 






















Phe 


Ala 


Glu 


Thr 


Vai 
V eiJL 


Val 

V a X 


Glu 




Leu 










T 1 


385 








390 










395 










400 


Leu Met 


He 


Asp 


Glu 


He 


Asn 


Asp 


He 


Arg 


He 


He 


Gly 


Ala 


He 


Thr 








405 










410 










415 




Vai Vai 


He 


Leu 


Leu 


Gly 


He 


Ser 


Val 


Ala 


Gly 


Met 


Glu 


Trp 


Glu 


Ala 






420 










425 










430 






Lys Ala 


Gin 


He 


Vai 


Leu 


Leu 


Val 


He 


Leu 


Leu 


Leu 


Ala 


He 


Gly 


Asp 




4 35 










440 










445 






Phe Vai 


He 


Gly 


Thr 


Phe 


He 


Pro 


Leu 


Glu 


Ser 


Lys 


Lys 


Pro 


Lys 


Gly 
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450 455 



Phe 


Phe 


Gly 


Tyr 


Lys 


Ser 


Glu 


He 


465 










4 70 






Phe Arg 


Glu 


Glu 


Glu 


Thr 


Phe 


Pne 










485 








Ala. 


AX a 


Tnr 


Gly 


He 


Leu 


Ala 


Gly 








C A A 










Asp 


Pro 


Gin 


Ser 


TV T _ 

Ala 


He 


Pro 


Lys 






jIO 










c; o n 


inr 


inr 


Leu 


Val 


Tyr 


Val 


Gly 


lie 














535 




va J. 


Arg 


Asp 


TV 1 ~ 

Ala 


Thr 


Gly Asn 


Val 


04 O 










550 






THr- 

i nj: 


Asn 


Cys 


Thr 


Ser 


Ala 


Ala 


Cys 










300 








Cys 


blU 


Ser 


Ser 


Pro 


Cys 


Ser 


Tyr 








580 










Met 


Ser 


Met 


Val 


Ser 


Gly 


Phe 


Thr 






595 










600 


Ser 


Ala 


Thr 


Leu 


Ser 


Ser 


Ala 


Leu 




blU 










615 




ixe 


rne 


Gin 


Ala 


Leu 


Cys 


Lys 


Asp 


^ o c 

625 










630 






rne 


TV 1 

Ala 


Lys 


Gly Tyr 


Gly Lys 


Asn 










645 








Leu 


inr 


Fit* ~ 

Phe 


Leu 


He 


Ala 


Leu 


Gly 








660 










vax 


lie 


TV 1 

Ala 


Pro 


He 


He 


Ser 


Asn 






b ZD 










680 


lie 


Asn 


Phe 


Ser 


Val 


Phe 


His 


Ala 




690 










695 




Arg 


Pro 


Ala 


Phe 


Lys 


Tyr 


Tyr 


Asn 












710 






lie 


Leu 


Cys 


Cys 


He 


Val 


Met 


Phe 










725 








Leu 


Tnr 


Tyr 


Val 


He 


Val 


Leu 


Gly 








740 










Lys 


Pro 


TV i^wik 

ASp 


Val 


Asn 


Trp Gly 


Ser 






n c c 

755 










760 


Asn 


A±a 


Leu 


Gin 


His 


Ser 


He 


Arg 




inn 










775 




Lys 


Asn 


rne 


Arg 


Pro 


Gin 


Cys 


Leu 


T Q c; 










790 






Arg 


Pro 


Ala 


Leu 


Leu 


His 


Leu 


Val 










805 








Xjc u 




T 1 « 

lie 


Cys 


Gly 


His 


Val 


His 








820 










Lys 


olU 


Me u 


Ser 


He 


Asp 


Gin 


Ala 


















Asn 


Lys 


Met 


Lys Ala 


Phe 


Tyr 


Ala 




850 










855 




Glu 


Gly 


Ala 


Gin 


Tyr 


Leu 


Met 


Gin 


865 










870 






Pro 


Asn 


Thr 


Leu 


Val 


Leu Gly 


Phe 










885 








Met 


Arg 


Asp 


Val 


Asp 


Met 


Tyr 


He 








900 










He 


Gin 


Tyr 


Gly Val 


Val 


Val 


He 






915 










920 



460 



Phe 


Asn 


Glu 


Asn 


Phe 


Gly 


Pro 


Asp 






A 1 C 

4 lb 










480 


Ser 


Val 


Phe 


Ala 


He 


Phe 


Phe 


Pro 




A orv 










4 95 




Ala 


Asn 


He 


Ser 


Gly 


Asp Leu Ala 


c n c 










510 






Gly 


Thr 


Leu 


Leu 


Ala 


He 


Leu 


He 










525 








TV 1 

Ala 


Val 


Ser 


Val 


Gly 


Ser 


Cys 


Val 








54 0 










Asn 


Asp 


Thr 


He 


Val 


Thr 


Glu 


Leu 






c c c 

555 










560 


Lys 


Leu 


Asn 


Phe 


Asp 


Phe 


Ser 


Ser 




c "J rv 

570 










575 




Gly 


Leu 


Met 


Asn 


Asn 


Phe 


Gin 


Val 


Q c: 










590 






Pro 


Leu 


He 


Ser 


Ala 


Gly 


He 


Phe 










605 








Ala 


Ser 


Leu 


Val 


Ser 


Ala 


Pro 


Lys 








620 








Asn 


He 


Tyr 


Pro 


Ala 


Phe 


Gin 


Met 






635 










640 


Asn 


Glu 


Pro 


Leu 


Arg 


Gly Tyr 


He 




550 










655 




Phe 


He 


Leu 


He 


Ala 


Glu 


Leu 


Asn 


ODD 










670 






Phe 


Phe 


Leu 


Ala 


Ser 


Tyr Ala 


Leu 










685 








Ser 


Leu 


Ala 


Lys 


Ser 


Pro Gly Trp 








700 










Met 


Trp 


He 


Ser 


Leu 


Leu 


Gly Ala 






715 










720 


Val 


He 


Asn 


Trp 


Trp 


Ala 


Ala 


Leu 




730 










735 




Leu 


Tyr 


He 


Tyr 


Val 


Thr 


Tyr 


Lys 


I HO 










750 






Ser 


Thr 


Gin 


Ala 


Leu 


Thr Tyr Leu 










765 








Leu 


Ser 


Gly 


Val 


Glu 


Asp 


His 


Val 








"7 0 rv 
/oU 










Val 


Met 


Thr 


Gly 


Ala 


Pro 


Asn 


Ser 






/ 










800 


nXS 


Asp 


Til- _ 

Phe 


Thr 


Lys 


Asn 


Val 


Gly 




Pin 










815 




Met 


Gly 


Pro 


Arg 


Arg 


Gin 


Ala 


Met 












830 






Lys 


Tyr 


Gin 


Arg 


Trp 


Leu 


He 


Lys 










84 5 








Pro 


Val 


His 


Ala 


Asp 


Asp 


Leu 


Arg 








OCA 

odO 










Ala 


Ala 


Gly 


Leu 


Gly 


Arg 


Met 


Lys 






875 










880 


Lys 


Lys 


Asp 


Trp 


Leu 


Gin 


Ala 


Asp 




890 










895 




Asn 


Leu 


Phe 


His 


Asp 


Ala 


Phe 


Asp 


905 










910 




Arg 


Leu 


Lys 


Glu 


Gly 


Leu Asp 


He 










925 
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Ser 


His Leu 


Gin 


Gly 


Gin 


Glu 


Glu 


Leu 


Leu 


Ser 


Ser 


Gin 


Glu 


Lys 


Ser 




930 
















940 










Pro Gly Thr 


Lys 




Val 


Val 


VaL 


Ser 


Val 


Glu 


Tvr 


Ser 


Lvs 


Lys 


Ser 




















955 










960 


Asp 


Leu Asp 


Thr 


J. 


xj jf o 


Pro 


Leu 


Ser 


Glu 
970 


Lys 


Pro 


He 


Thr 


His 
975 


Lys 


Val 


Glu Glu 


Glu 

980 


Asp 


Glv 


Lys 


Thr 


Ala 

985 


Thr 


Gin 


Pro 


Leu 


Leu 
990 


Lvs 


Lvs 


Glu 


Ser Lys 


Gly 


Pro 


He 


Val 


Pro 


Leu 


Asn 


Val 


Ala 


Asp 


Gin 


Lvs 


Leu 




995 










1000 








1005 






Leu 


Glu Ala 


Ser 


Thr 


Gin 


Phe 


Gin 


Lys 


Lys 


Gin 


Gly 


Lys 


Asn 


Thr 


He 




1010 








1015 








1020 








Asp 


Val Trp 


Trp 


Leu 


Phe 


Asp Asp 


Gly 


Gly 


Leu 


Thr 


Leu 


Leu 


He 


Pro 


1025 






1030 








1035 








1040 


Tyr 


Leu Leu 


Thr 


Thr 


Lys 


Lys 


Lys 


Trp 


Lys 


Asp 


Cys 


Lys 


He 


Arg 


Val 








1045 








1050 








1055 


Phe 


He Gly 


Gly 


Lys 


He 


Asn 


Arg 


He 


Asp 


His 


Asp 


Arg 


Arg 


Ala 


Met 






1060 








1065 








1070 




Ala 


Thr Leu 


Leu 


Ser 


Lys 


Phe 


Arg 


He 


Asp 


Phe 


Ser 


Asp 


He 


Met 


Val 




1075 








1080 








1085 






Leu 


Gly Asp 


He 


Asn 


Thr 


Lys 


Pro 


Lys 


Lys 


Glu 


Asn 


He 


He 


Ala 


Phe 




1090 








1095 








1100 








Glu 


Glu He 


He 


Glu 


Pro 


Tyr 


Arg 


Leu 


His 


Glu 


Asp Asp 


Lys 


Glu 


Gin 


1105 






1110 








1115 








1120 


Asp 


He Ala 


Asp 


Lys 


Met 


Lys 


Glu 


Asp 


Glu 


Pro 


Trp Arg 


He 


Thr 


Asp 








1125 








1130 








1135 


Asn 


Glu Leu 


Glu 


Leu 


Tyr 


Lys 


Thr 


Lys 


Thr 


Tyr 


Arg 


Gin 


He 


Arg 


Leu 






1140 








1145 








1150 




Asn 


Glu Leu 


Leu 


Lys 


Glu 


His 


Ser 


Ser 


Thr 


Ala 


Asn 


He 


He 


Val 


Met 




1155 








1160 








1165 






Ser 


Leu Pro 


Val 


Ala 


Arg 


Lys 


Gly Ala 


Val 


Ser 


Ser 


Ala 


Leu 


Tyr 


Met 




1170 








1175 








1180 








Ala 


Trp Leu 


Glu 


Ala 


Leu 


Ser 


Lys 


Asp 


Leu 


Pro 


Pro 


He 


Leu 


Leu 


Val 


1185 






1190 








1195 








1200 


Arg Gly Asn His 


Gin 


Ser 


Val 


Leu 


Thr 


Phe 


Tyr 


Ser 











1205 1210 



<210> 75 
<211> 2778 

<212> DNA 

<213> Homo sapiens 

<400> 75 

gtcgacccac gcgtccggca agaagctgac gggtcgcctc atgctggccg tgggaggagc 60 
agtgcttggc tccctgcagt ttggctacaa cactggagtc atcaatgccc cccagaaggt 120 
gatcgaggag ttctacaacc agacatgggt ccaccgctat ggggagagca tcctgcccac 180 
cacgctcacc acgctctggt ccctctcagt ggccatcttt tctgttgggg gcatgattgg 240 
ctccttctct gtgggccttt tcgttaaccg ctttggccgg cggaattcaa tgctgatgat 300 
gaacctgctg gccttcgtgt ccgccgtgct catgggcttc tcgaaactgg gcaagtcctt 360 
tgagatgctg atcctgggcc gcttcatcat cggtgtgtac tgcggcctga ccacaggctt 420 
cgtgcccatg tatgtgggtg aagtgtcacc cacagccctt cgtggggccc tgggcaccct 480 
gcaccagctg ggcatcgtcg tcggcatcct catcgcccag gtgttcggcc tggactccat 540 
catgggcaac aaggacctgt ggcccctgct gctgagcatc atcttcatcc cggccctgct 600 
gcagtgcatc gtgctgccct tctgccccga gagtccccgc ttcctgctca tcaaccgcaa 660 
cgaggagaac cgggccaaga gtgtgctaaa gaagctgcgc gggacagctg acgtgaccca 720 
tgacctgcag gagatgaagg aagagagtcg gcagatgatg cgggagaaga aggtcaccat 780 
cctggagctg ttccgctccc ccgcctaccg ccagcccatc ctcatcgctg tggtgctgca 84 0 
gctgtcccag cagctgtctg gcatcaacgc tgtcttctat tactccacga gcatcttcga 900 
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gaaggcgggg gtgcagcagc ctgtgtatgc caccattggc tccggtatcg tcaacacggc 960 

cttcactgtc gtgtcgctgt ttgtggtgga gcgagcaggc cggcggaccc tgcacctcat 1020 

aggcctcgct ggcatggcgg gttgtgccat actcatgacc atcgcgctag cactgctgga 1080 

gcagctaccc tggatgtcct atctgagcat cgtggccatc tttggctttg tggccttctt 1140 

tgaagtgggt cctggcccca tcccatggtt catcgtggct gaactcttca gccagggtcc 1200 

acgtccagct gccattgccg ttgcaggctt ctccaactgg acctcaaatt tcattgtggg 1260 

catgtgcttc cagtatgtgg agcaactgtg tggtccctac gtcttcatca tcttcactgt 1320 

gctcctggtt ctgttcttca tcttcaccta cttcaaagtt cctgagacta aaggccggac 1380 

cttcgatgag atcgcttccg gcttccggca ggggggagcc agccaaagtg acaagacacc 1440 

cgaggagctg ttccatcccc tgggggctga ttcccaagtg tgagtcgccc cagatcacca 1500 

gcccggcctg ctcccagcag ccctaaggat ctctcaggag cacaggcagc tggatgagac 1560 

ttccaaacct gacagatgtc agccgagccg ggcctggggc tcctttctcc agccagcaat 1620 

gatgtccaga agaatattca ggacttaacg gctccaggat tttaacaaaa gcaagactgt 1680 

tgctcaaatc tattcagaca agcaacaggt tttataattt ttttattact gattttgtta 1740 

tttttatatc agcctgagtc tcctgtgccc acatcccagg cttcaccctg aatggttcca 1800 

tgcctgaggg tggagactaa gccctgtcga gacacttgcc ttcttcaccc agctaatctg 1860 

tagggctgga cctatgtcct aaggacacac taatcgaact atgaactaca aagcttctat 1920 

cccaggaggt ggctatggcc acccgttctg ctggcctgga tctccccact ctaggggtca 1980 

ggctccatta ggatttgccc cttcccatct cttcctaccc aaccactcaa attaatcttt 2040 

ctttacctga gaccagttgg gagcactgga gtgcagggag gagaggggaa gggccagtct 2100 

gggctgccgg gttctagtct cctttgcact gagggccaca ctattaccat gagaagaggg 2160 

cctgtgggag cctgcaaact cactgctcaa gaagacatgg agactcctgc cctgttgtgt 2220 

atagatgcaa gatatttata tatatttttg gttgtcaata ttaaatacag acactaagtt 2280 

atagtatatc tggacaagcc aacttgtaaa tacaccacct cactcctgtt acttacctaa 2340 

acagatataa atggctggtt tttagaaaca tggttttgaa atgcttgtgg attgagggta 2400 

ggaggtttgg atgggagtga gacagaagta agtggggttg caaccactgc aacggcttag 2460 

acttcgactc aggatccagt cccttacacg tacctctcat cagtgtcctc ttgctcaaaa 2520 

atctgtttga tccctgttac ccagagaata tatacattct ttatcttgac attcaaggca 2580 

tttctatcac atatttgata gttggtgttc aaaaaaacac tagttttgtg ccagccgtga 2640 

tgctcaggct tgaaatgcat tattttgaat gtgaagtaaa tactgtacct ttattggaca 2700 

ggctcaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2760 
aaaaaaaagg gcggccgc 2778 

<210> 76 
<211> 480 
<212> PRT 

<213> Homo sapiens 



<400> 76 



Met 


Leu 


Ala 


Val 


Gly 


Gly 


Ala 


Val 


1 








5 








Asn 


Thr 


Gly 


Val 


He 


Asn 


Ala 


Pro 








20 










Asn 


Gin 


Thr 


Trp 


Val 


His 


Arg 


Tyr 






35 










40 


Leu 


Thr 


Thr 


Leu 


Trp 


Ser 


Leu 


Ser 




50 










55 




Met 


He 


Gly 


Ser 


Phe 


Ser 


Val 


Gly 


65 










70 






Arg 


Asn 


Ser 


Met 


Leu 


Met 


Met 


Asn 










85 








Leu 


Met 


Gly 


Phe 


Ser 


Lys 


Leu 


Gly 








100 










Gly 


Arg 


Phe 


He 


He 


Gly 


Val 


Tyr 






115 










120 


Pro 


Met 


Tyr 


Val 


Gly 


Glu 


Val 


Ser 




130 










135 




Gly 


Thr 


Leu 


His 


Gin 


Leu 


Gly 


He 



145 150 



Leu 


Gly 


Ser 


Leu 


Gin 


Phe 


Gly 


Tyr 




10 










15 




Gin 


Lys 


Val 


He 


Glu 


Glu 


Phe 


Tyr 


25 










30 






Gly 


Glu 


Ser 


He 


Leu 


Pro 


Thr 


Thr 










45 








Val 


Ala 


He 


Phe 


Ser 


Val 


Gly 


Gly 








60 










Leu 


Phe. 


Val 


Asn 


Arg 


Phe 


Gly 


Arg 






75 










80 


Leu 


Leu 


Ala 


Phe 


Val 


Ser 


Ala 


Val 




90 










95 




Lys 


Ser 


Phe 


Glu 


Met 


Leu 


He 


Leu 


105 










110 






Cys 


Gly 


Leu 


Thr 


Thr 


Gly 


Phe 


Val 










125 








Pro 


Thr 


Ala 


Leu 


Arg 


Gly 


Ala 


Leu 








140 










Val 


Val 


Gly 


He 


Leu 


He 


Ala 


Gin 



155 160 
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VaX tins 


Gly Leu 






T 1 tf> 


nc u 


oxy 


Asn 


Lys 


Asp 


Leu 


Trp 


Pro 


Leu 








1 OD 










1 /U 








175 




T 1 T.011 


Ser 


lie 


T 1 o 


It llt= 


Tie 


U L, \J 




Leu 


Leu 


bin 


Cys 


He 


Vai 


Leu 






1 on 










1 0 D 










190 






iTiO tiie 


Cys 


rro 


UJL U 




Pv^ 


Arg 


Pho 

It ne 


Leu 


Leu 


T 1 A 

lie 


Asn 


Arg 


Asn 


Glu 














9nn 










9 A 








Arg 


AX a 


Lys 


oer 


V ai 


Leu 


bys 


Lys 


Leu 


Arg 


bly 


Thr 


Ala 


Asp 


9 1 n 

iC 1 u 










91 R 
Z J. □ 










9 9n 










nlS 


Asp 


Leu 


f^l n 

b±n 






Lys 


o±U 


rill 

ulU 


Ca V 

oer 


Arg 


Gin 


Met 


Met 




















9 "3 R 










240 




Lys 


Lys 


V a ± 


i ili 


T 1 <a 


Leu 


bl u 


Leu 


PKa 

rne 


Arg 


Ser 


Pro 


Aia 


Tyr 




























255 


TV /-I 

ArQ Gin 


Pro 


lie 


Leu 


lie 




vai 


\73 1 

vai 


Leu 


bin 


Leu 


ber 


Gin 


Gin 


Leu 






9 










9 CL 










O "J 1^ 

270 








He 


IV n M 

Asn 




Vox 




lyr 


Tyr 


Ser 


1 nr 


oer 


T 1 A 

lie 


fne 


blU 


hys 




O "1 c 










9Q n 










o o c 








Ala biy 


vai 


bin 


Liin 


Pro 


vai 


Tyr 


Ala 


i nr 


lie 


bly 


Ser 


Gly 


He 


Vai 


^: y u 




















"3 n n 










Asn inr 


TV T — 

Ala 


Phe 


I nr 


vai 


\7a 1 

vai 


Ser 


Leu 


Pne 


Vai 


Vai 


Glu 


Arg 


Ala 


Gly 










olU 










j15 








320 


TV TV M 


Thr 


Leu 


HIS 


Leu 


lie 


biy 


Leu 


TV 1 -* 

Ala 


Gly 


Met 


Ala 


Gly 


Cys 


Ala 


















"5 O A 










335 




1X6 L6U 


Met 


Thr 


T 1 ta 

lie 


Aia 


Leu 


Aia 


Leu 


Leu 


nil 
blU 


bin 


Leu 


Pro 


Trp 


Met 






340 










OH 0 
















OCX 1 yr 


Leu 


Ser 


lie 


vai 


Aia 


1 le 


Fne 


biy 


rne 


vai 


Ala 


rne 


Pne 


Glu 




355 




















o ^ c 
ODD 








vai biy 


Pro 


Gly 


Pro 


lie 


Pro 


Trp 


Fne 


1 le 


Vai 


Ala 


Glu 


Leu 


Phe 


Ser 


J / u 










"5 7 ^ 










"3 o r\ 










\3±ii oi.y 


Pro Arg 


Pro 


Ala 


Aia 


lie 


Ala 


vai 


Ala 


bly 


rne 


Ser 


Asn 


Trp 


Q t; 




























400 




Asn 


Phe 


lie 


vai 


vaiy 


Met 


Cys 


rne 


bin 


Tyr 


\7-» 1 

vai 


blU 


Gin 


Leu 


















4 10 










415 




Cys Gly 


Pro 


Tyr 


vai 


rne 


lie 


lie 


rne 


rnr 


vai 


Leu 


Leu 


Vai 


Leu 


Phe 






420 




















4 30 






Phe He 


Phe 


Thr 


Tyr 


Phe 


Lys 


Vai 


Pro 


Glu 


Thr 


Lys 


Gly 


Arg 


Thr 


Phe 




435 










440 










445 








Asp Glu 


He 


Ala 


Ser 


Gly 


Phe 


Arg 


Gin 


Gly 


Gly 


Ala 


Ser 


Gin 


Ser 


Asp 


450 










455 










460 










Lys Thr 


Pro 


Glu 


Glu 


Leu 


Phe 


His 


Pro 


Leu 


Gly 


Ala 


Asp 


Ser 


Gin 


Vai 


4 65 








470 










475 










480 



<210> 77 
<211> 2473 
<212> DNA 

<213> Homo sapiens 
<400> 77 

gtcgacccac gcgtccgcgc gaggcgcggg gagcctggga ccaggagcga gagccgccta 60 

cctgcagccg ccgcccacgg cacggcagcc accatggcgc tcctgctgtg cttcgtgctc 120 

ctgtgcggag tagtggattt cgccagaagt ttgagtatca ctactcctga agagatgatt 180 

gaaaaagcca aaggggaaac tgcctatctg ccatgcaaat ttacgcttag tcccgaagac 240 

cagggaccgc tggacatcga gtggctgata tcaccagctg ataatcagaa ggtggatcaa 300 

gtgattattt tatattctgg agacaaaatt tatgatgact actatccaga tctgaaaggc 360 

cgagtacatt ttacgagtaa tgatctcaaa tctggtgatg catcaataaa tgtaacgaat 420 

ttacaactgt cagatattgg cacatatcag tgcaaagtga aaaaagctcc tggtgttgca 480 

aataagaaga ttcatctggt agttcttgtt aagccttcag gtgcgagatg ttacgttgat 54 0 

ggatctgaag aaattggaag tgactttaag ataaaatgtg aaccaaaaga aggttcactt 600 

ccattacagt atgagtggca aaaattgtct gactcacaga aaatgcccac ttcatggtta 660 
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gcagaaatga cttcatctgt 
tacagctgta cagtcagaaa 
gtccctcctt caaataaagc 
ctagcgctca ttggtcttat 
gaaaaggaag ttcatcacga 
actgccagaa gctacatcgg 
atggaaggat attccaagac 
cctcagagtc cgactctccc 
gcgattcctg tgatgattcc 
atgtctcatc tgtgctctcc 
gtctaaattg tgttactagc 
ggaatatatt tttaaaaatt 
agtaaagaaa agtttaccat 
ttttctaaat ttatcagtac 
aagacatggg tgaacttact 
tacttgatca tttttaccat 
gtctccccca aattagtaca 
ggttttacag ctcctttgaa 
tacagtggtt tagaaattac 
atgacttctt aaatatttag 
tttatatgct aaaggagcat 
tgagatgaca ctaggtgcaa 
gagatctgtg gtggtcttca 
ttcccaggtg atatttttct 
atagattgaa atttcctaat 
atgttgttgt tgtttttgga 
aatttaccct cttgaatata 
aaatacattt taagcaagtt 
tgtacagttg ctgttgtgtg 
gctttgaaaa agtttgtctt 
aaagggcggc cgc 

<210> 78 
<211> 365 
<212> PRT 

<213> Homo sapiens 



<400> 78 



Met 


Ala 


Leu 


Leu 


Leu 


Cys 


Phe 


Val 


Leu 


Leu 


Cys 


Gly 


Val 


Val 


Asp 


Phe 


1 








5 










10 










15 




Ala Arg Ser 


Leu 


Ser 


He 


Thr 


Thr 


Pro 


Glu 


Glu 


Met 


He 


Glu 


Lys 


Ala 








20 










25 










30 






Lys 


Gly. Glu 


Thr 


Ala 


Tyr 


Leu 


Pro 


Cys 


Lys 


Phe 


Thr 


Leu 


Ser 


Pro 


Glu 






35 










40 










45 








Asp Gin 


Gly 


Pro 


Leu 


Asp 


He 


Glu 


Trp 


Leu 


He 


Ser 


Pro 


Ala 


Asp 


Asn 




50 










55 










60 










Gin 


Lys 


Val 


Asp 


Gin 


Val 


He 


He 


Leu 


Tyr 


Ser 


Gly 


Asp 


Lys 


He 


Tyr 


65 










70 










75 










80 


Asp 


Asp 


Tyr 


Tyr 


Pro 
85 


Asp 


Leu 


Lys 


Gly 


Arg 
90 


Val 


His 


Phe 


Thr 


Ser 
95 


Asn 


Asp 


Leu 


Lys 


Ser 
100 


Gly 


Asp 


Ala 


Ser 


He 
105 


Asn 


Val 


Thr 


Asn 


Leu 
110 


Gin 


Leu 


Ser 


Asp 


He 
115 


Gly 


Thr 


Tyr 


Gin 


Cys 
120 


Lys 


Val 


Lys 


Lys 


Ala 

125 


Pro 


Gly 


Val 


Ala 


Asn 
130 


Lys 


Lys 


He 


His 


Leu 

135 


Val 


Val 


Leu 


Val 


Lys 
140 


Pro 


Ser 


Gly 


Ala 


Arg 


Cys 


Tyr 


Val 


Asp 


Gly 


Ser 


Glu 


Glu 


He 


Gly 


Ser 


Asp 


Phe 


Lys 


He 


145 










150 










155 










160 


Lys 


Cys 


Glu 


Pro 


Lys 


Glu 


Gly 


Ser 


Leu 


Pro 


Leu 


Gin 


Tyr 


Glu 


Trp 


Gin 



tatatctgta aaaaatgcct cttctgagta ctctgggaca 720 
cagagtgggc tctgatcagt gcctgttgcg tctaaacgtt 780 
tggactaatt gcaggagcca ttataggaac tttgcttgct 840 
catcttttgc tgtcgtaaaa agcgcagaga agaaaaatat 900 
tatcagggaa gatgtgccac ctccaaagag ccgtacgtcc 960 
cagtaatcat tcatccctgg ggtccatgtc tccttccaac 1020 
tcagtataac caagtaccaa gtgaagactt tgaacgcact 1080 
acctgctaag gtagctgccc ctaatctaag tcgaatgggt 1140 
agcacagagc aaggatgggt ctatagtata gagcctccat 1200 
gtgttccttt cctttttttg atatatgaaa acctattctg 1260 
ctcaaaatac atcaaaaaat aagttaatca ggaactgtac 1320 
tttgtttggt tatatcaaaa tagttacagg cactaaagtt 1380 
ctgaaaaagc tggattttct ttaagaggtt gattataaag 1440 
ctaagtaaga tgtagcgctt tgaatatgaa atcataggtg 1500 
tgcataccaa gttgatactt gaataaccat ctgaaagtgg 1560 
tatttttagg atgtgtattt catttattta tggcccacca 1620 
gaaatatcca tgacaaaatt acttacgtat gtttgtactt 1680 
aactctgtgt ttggaatatc tctaaaaaca tagaaaacac 1740 
taattttact tctaagtcat tcataaacct tgtctatgaa 1800 
ttgatagact gctacaggta atagggactt agcaagctct 1860 
ctatcagatt aagttagaac atttgctgtc agccacatat 1920 
tagcagggat agattttgtt ggtgagtagt ctcatgcctt 1980 
aaatggtggc cagccagatc aaggatgtag tatctcatag 2040 
tattagaaaa atattataac tcatttgttg tttgacactt 2100 
ttattctaaa ttttaagtgg ttctttggtt ccagtgcttt 2160 
tggtgttaca tattatatgt tctagaaaca tgtaatccta 2220 
atccctggat gatatttttt atcataaatg cagaataatc 2280 
aagtgtcctc catcaattct gtattccaga cttgggagga 2340 
atcaaacatg tctctgtgta gttccagcaa atcaagctga 2400 
agttttgtga aggtgattta ttcttaaaaa aaaaaaaaaa 2460 

2473 
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165 170 175 



Lys 


Leu 


ber 


Asp 

1 Q n 
loU 


Ser 


Gin 


Lys 


Met 


Pro 
185 


Thr 


Ser 


Trp 


Leu 


Ala Glu 
190 


Met 


Thr 


Ser 


Ser 


vai 


T 1 A 

lie 


Ser 


Val 


Lys 


Asn 


Ala 


Ser 


Ser 


Glu 


Tyr Ser 


Gly 






195 










200 










205 




Thr 


Tyr 


Ser 


Cys 


Tnr 


Val 


Arg Asn Arg Val Gly 


Ser 


Asp Gin Cys 


Leu 




210 




















o o rv 








Leu 


Arg 


Leu 


Asn 


Val 


Val 


Pro 


Pro 


Ser 


Asn 


Lys 


TV 1 » 

Ala 


Gly 


Leu He 


Ala 


225 










O "3 
^ JO 










O T C 

23d 








240 


Gly Ala 


T 1 A 

ixe 


lie 


Gly 


Thr 


Leu 


Leu 


Ala 


Leu 


Ala 


Leu 


He 


Gly Leu 


He 




















250 








255 




lie 


Phe 


Cys 


Cys 
260 


Arg 


Lys 


Lys 


Arg 


Arg 

o ^ c 
Z DO 


Glu 


Glu 


Lys 


Tyr 


Glu Lys 
270 


Glu 


Val 


His 


nlS 


Asp 


lie 


Arg 


Glu 


Asp 


Val 


Pro 


Pro 


Pro 


Lys 


Ser Arg 


Thr 






0*7 C 
£. to 










280 










285 




Ser 


Thr 


Ala 


Arg 


Ser 


Tyr 


lie 


Gly 


Ser 


Asn 


His 


Ser 


Ser 


Leu Gly 


Ser 




290 










295 










300 






Met 


Ser 


Pro 


Ser 


Asn 


Met 


Glu 


Gly 


Tyr 


Ser 


Lys 


Thr 


Gin 


Tyr Asn 


Gin 


305 










310 










315 








320 


Val 


Pro 


Ser 


Glu 


Asp 

325 


Phe 


Glu 


Arg 


Thr 


Pro 

330 


Gin 


Ser 


Pro 


Thr Leu 

335 


Pro 


Pro 


Ala 


Lys 


Val 


Ala 


Ala 


Pro 


Asn 


Leu 


Ser Arg 


Met 


Gly 


Ala He 


Pro 








340 










345 










350 




Val 


Met 


lie 
355 


Pro 


Ala 


Gin 


Ser 


Lys 
360 


Asp 


Gly 


Ser 


He 


Val 
365 







<210> 79 
<211> 1588 
<212> DNA 

<213> Homo sapiens 

<400> 79 

gtcgacccac gcgtccggca 
ggcaggatga actggacagg 
gccattggcc gagtatggct 
gctgcagaga gtgtgtgggg 
ggctgcaaca gcgtttgcta 
ctgcagctca tcctagtttc 
caacacatag agaagaaaat 
gaggtgaaga ggcacaaggt 
gtggtgttcC' ggctgttgtt 
ggctatgcca tggtgcggct 
tgcttcgtgt cccgccccac 
ggcatctgca tcatcctcaa 
cgccgagccc agcgccgctc 
ctctcacctg aatacaagca 
ctgaaagaca tactgcgccg 
cgctgctcgg cctgctgatg 
cctgggcgag cccctccttc 
tcgatcaaaa ccttccttcc 
gctggagggg tggggagcta 
ggctgccctt gttgcctgca 
tacaagagat gggatgctcc 
tgtcagatac cctgtttctg 
agggcagggg tgctgtggac 
aggaatagtg cttgctggcc 
gagagagaag caggcagata 
tccccaaggc ctctctctgc 



gcagcagcca ggtgtggcag 
tttgtacacc ttgctcagtg 
ctcggtcatc ttcatcttca 
tgatgagaaa tcttccttca 
tgaccaattc ttccccatct 
caccccagct ctcctcgtgg 
gctacggctt gagggccatg 
ccacatctca gggacactgt 
tgaggccgtc ttcatgtatg 
ggtcaagtgc gacgtctacc 
cgagaaaacc gtcttcaccg 
tgtggccgag gtggtgtacc 
caatccacct tcccgcaagg 
gaatgagatc aacaagctgc 
cagccctggc accggggctg 
ccacatacca ggcaacctcc 
tcccctgccg gtgcacaggc 
ctggctactt cccttcctcc 
gaggccacct atgccagtgc 
cccttccctc ttccctctcc 
gacagcgtct ccaattatga 
gagtcacatc agtgaggagg 
atgtgggtgg agaagggagg 
acaaggaaaa ggaggaggtg 
agttggagca ggggttggtc 
ctgaaatgtt acacattaaa 



tgacagggag gtgtgaatga 60 
gcgtgaaccg gcattctact 120 
gaatcatggt gctggtggtg 180 
tctgcaacac actccagcct 240 
cccatgtgcg gctgtggtcc 300 
ccatgcacgt ggctcaccag 360 
gggaccccct acacctggag 420 
ggtggaccta tgtcatcagc 480 
tcttttatct gctctaccct 540 
cctgccccaa cacagtggac 600 
tcttcatgct agctgcctct 660 
tcatcatccg ggcctgtgcc 720 
gctcgggctt cggccaccgc 780 
tgagtgagca ggatggctcc 840 
ggctggctga aaagagcgac 900 
catcccaccc ccgaccctgc 960 
ctctgcctgc tggggattac 1020 
cggggccttc cttttgagga 1080 
tcaaggttac tgggagtgtg 1140 
ctctctctgg gaccactggg 1200 
aactaatctt aaccctgtgc 1260 
gatgtgggta agaggagcag 1320 
gtggccagca ctagtaaagg 1380 
tctggggtga gggagttagg 1440 
aaggccacct ctgcctctag 1500 
caggatttta cagtaaatga 1560 
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aaaaaaaaaa aaaaaaaagg gcggccgc 1588 

<210> 80 

<211> 283 

<212> PRT 

<213> Homo sapiens 

<400> 80 

Met Asn Trp Thr Gly Leu Tyr Thr Leu Leu Ser Gly Val Asn Arg His 

15 10 15 

Ser Thr Ala lie Gly Arg Val Trp Leu Ser Val He Phe He Phe Arg 

20 25 30 

He Met Val Leu Val Val Ala Ala Glu Ser Val Trp Gly Asp Glu Lys 

35 40 45 

Ser Ser Phe He Cys Asn Thr Leu Gin Pro Gly Cys Asn Ser Val Cys 

50 ■ 55 60 

Tyr Asp Gin Phe Phe Pro He Ser His Val Arg Leu Trp Ser Leu Gin 
65 70 75 80 

Leu He Leu Val Ser Thr Pro Ala Leu Leu Val Ala Met His Val Ala 

85 90 95 

His Gin Gin His He Glu Lys Lys Met Leu Arg Leu Glu Gly His Gly 

100 105 110 

Asp Pro Leu His Leu Glu Glu Val Lys Arg His Lys Val His He Ser 

115 120 125 

Gly Thr Leu Trp Trp Thr Tyr Val He Ser Val Val Phe Arg Leu Leu 

130 135 140 

Phe Glu Ala Val Phe Met Tyr Val Phe Tyr Leu Leu Tyr Pro Gly Tyr 
145 150 155 160 

Ala Met Val Arg Leu Val Lys Cys Asp Val Tyr Pro Cys Pro Asn Thr 

165 170 175 

Val Asp Cys Phe Val Ser Arg Pro Thr Glu Lys Thr Val Phe Thr Val 

180 185 190 

Phe Met Leu Ala Ala Ser Gly He Cys He He Leu Asn Val Ala Glu 

195 200 205 

Val Val Tyr Leu He He Arg Ala Cys Ala Arg Arg Ala Gin Arg Arg 

210 215 220 

Ser Asn Pro Pro Ser Arg Lys Gly Ser Gly Phe Gly His Arg Leu Ser 
225 230 235 240 

Pro Glu Tyr Lys Gin Asn Glu He Asn Lys Leu Leu Ser Glu Gin Asp 

245 250 255 

Gly Ser Leu Lys Asp He Leu Arg Arg Ser Pro Gly Thr Gly Ala Gly 

260 265 270 

Leu Ala Glu Lys Ser Asp Arg Cys Ser Ala Cys 
275 280 



<210> 81 

<211> 3337 

<212> DNA 

<213> Homo sapiens 

<400> 81 

gtcgacccac gcgtccggag ccagctctcc cgagcccgta accttcgcat cccaagagct 60 

gcagtttcag ccgcgacagc aagaacggca gagccggcga ccgcggcggc ggcggcggcg 120 

gaggcaggag cagcctgggc gggtcgcagg gtctccgcgg gcgcaggaag gcgagcagag 180 

atatcctctg agagccaagc aaagaacatt aaggaaggaa ggaggaatga ggctggatac 240 

ggtgcagtga aaaaggcact tccaagagtg gggcactcac tacgcacaga ctcgacggtg 300 

ccatcagcat gagaacttac cgctacttct tgctgctctt ttgggtgggc cagccctacc 360 

caactctctc aactccacta tcaaagagga ctagtggttt cccagcaaag aaaagggccc 420 
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tggagctctc tggaaacagc aaaaatgagc tgaaccgttc aaaaaggagc tggatgtgga 480 
atcagttctt tctcctggag gaatacacag gatccgatta tcagtatgtg ggcaagttac 540 
attcagacca ggatagagga gatggatcac ttaaatatat cctttcagga gatggagcag 600 
gagatctctt cattattaat gaaaacacag gcgacataca ggccaccaag aggctggaca 660 
gggaagaaaa acccgtttac atccttcgag ctcaagctat aaacagaagg acagggagac 720 
ccgtggagcc cgagtctgaa ttcatcatca agatccatga catcaatgac aatgaaccaa 780 
tattcaccaa ggaggtttac acagccactg tccctgaaat gtctgatgtc ggtacatttg 840 
ttgtccaagt cactgcgacg gatgcagatg atccaacata tgggaacagt gctaaagttg 900 
tctacagtat tctacaggga cagccctatt tttcagttga atcagaaaca ggtattatca 960 
agacagcttt gctcaacatg gatcgagaaa acagggagca gtaccaagtg gtgattcaag 1020 
ccaaggatat gggcggccag atgggaggat tatctgggac caccaccgtg aacatcacac 1080 
tgactgatgt caacgacaac cctccccgat tcccccagag tacataccag tttaaaactc 1140 
ctgaatcttc tccaccgggg acaccaattg gcagaatcaa agccagcgac gctgatgtgg 1200 
gagaaaatgc tgaaattgag tacagcatca cagacggtga ggggctggat atgtttgatg 1260 
tcatcaccga ccaggaaacc caggaaggga ttataactgt caaaaagctc ttggactttg 1320 
aaaagaagaa agtgtatacc cttaaagtgg aagcctccaa tccttatgtt gagccacgat 1380 
ttctctactt ggggcctttc aaagattcag ccacggttag aattgtggtg gaggatgtag 1440 
atgagccacc tgtcttcagc aaactggcct acatcttaca aataagagaa gatgctcaga 1500 
taaacaccac aataggctcc gtcacagccc aagatccaga tgctgccagg aatcctgtca 1560 
agtactctgt agatcgacac acagatatgg acagaatatt caacattgat tctggaaatg 1620 
gttcgatttt tacatcgaaa cttcttgacc gagaaacact gctatggcac aacattacag 1680 
tgatagcaac agagatcaat aatccaaagc aaagtagtcg agtacctcta tatattaaag 1740 
ttctagatgt caatgacaac gccccagaat ttgctgagtt ctatgaaact tttgtctgtg 1800 
aaaaagcaaa ggcagatcag ttgattcaga ccctgcatgc tgttgacaag gatgaccctt 1860 
atagtggaca ccaattttcg ttttccttgg cccctgaagc agccagtggc tcaaacttta 1920 
ccattcaaga caacaaagac aacacggcgg gaatcttaac tcggaaaaat ggctataata 1980 
gacacgagat gagcacctat ctcttgcctg tggtcatttc agacaacgac tacccagttc 2040 
aaagcagcac tgggacagtg actgtccggg tctgtgcatg tgaccaccac gggaacatgc 2100 
aatcctgcca tgcggaggcg ctcatccacc ccacgggact gagcacgggg gctctggttg 2160 
ccatccttct gtgcatcgtg atcctactag tgacagtggt gctgtttgca gctctgaggc 2220 
ggcagcgaaa aaaagagcct ttgatcattt ccaaagagga catcagagat aacattgtca 2280 
gttacaacga cgaaggtggt ggagaggagg acacccaggc ttttgatatc ggcaccctga 2340 
ggaatcctga agccatagag gacaacaaat tacgaaggga cattgtgccc gaagcccttt 24 00 
tcctaccccg acggactcca acagctcgcg acaacaccga tgtcagagat ttcattaacc 24 60 
aaaggttaaa ggaaaatgac acggacccca ctgccccgcc atacgactcc ttggccactt 2520 
acgcctatga aggcactggc tccgtggcgg attccctgag ctcgctggag tcagtgacca 2580 
cggatgcaga tcaagactat gattacctta gtgactgggg acctcgattc aaaaagcttg 2640 
cagatatgta tggaggagtg gacagtgaca aagactccta atctgttgcc tttttcattt 2700 
tccaatacga cactgaaata tgtgaagtgg ctatttcttt atatttatcc actactccgt 27 60 
gaaggcttct ctgttctacc cgttccaaaa gccaatggct gcagtccgtg tggatccaat 2820 
gttagagact tttttctagt acacttttat gagcttccaa ggggcaaatt tttatttttt 2880 
agtgcatcca gttaaccaag tcagcccaac aggcaggtgc cggaggggag gacagggaac 2940 
agtatttcca cttgttctca gggcagcgtg cccgcttccg ctgtcctggt gttttactac 3000 
actccatgtc aggtcagcca actgccctaa ctgtacattt cacaggctaa tgggataaag 3060 
gactgtgctt taaagataaa aatatcatca tagtaaaaga aatgagggca tatcggctca 3120 
caaagagata aactacatag gggtgtttat ttgtgtcaca aagaatttaa aataacactt 3180 
gcccatgcta tttgttcttc aagaactttc tctgccatca actactattc aaaacctcaa 3240 
atccacccat atgttaaaat tctcattact cttaaggaat agaagcaaat taaacggtaa 3300 
catccaaaag caaaaaaaaa aaaaaaaggg cggccgc 3337 

<210> 82 
<211> 790 
<212> PRT 

<213> Homo sapiens 
<400> 82 

Met Arg Thr Tyr Arg Tyr Phe Leu Leu Leu Phe Trp Val Gly Gin Pro 

15 10 15 

Tyr Pro Thr Leu Ser Thr Pro Leu Ser Lys Arg Thr Ser Gly Phe Pro 
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Ala 


Lys 


Lys 


Arg 






35 




Asn 


Arg 




Lys 












lyr 


Thr* 
1 ilX, 


Pit/ 
biy 


C R 
DD 








bin 


Asp 


Arg 


bly 


A J. a 




Asp 


Leu 








i uu 


Thr 


Lys 


Arg 


Leu 






115 




bin 


Ala 


He 


Asn 




1 






rne 


lie 


lie 


Lys 


14 3 








Lys 


Glu 


Val 


Tyr 


Phe 


Val 


vai 


bin 








180 


Asn 


Ser 


Ala 


Lys 






195 




Ser 


Val 


Glu 


Ser 




210 






Asp Arg 


blU 


Asn 


225 








Met 


Gly 


bly 


bin 


Thr 


Leu 


Thr 


Asp 








2 60 


Tyr 


Gin 


Phe 


Lys 






275 




Arg 


He 


Lys 


Ala 




290 






Tyr 


Ser 


T 1 « 

He 


Thr 


305 








Asp Gin 


blU 


Tnr 


Phe 


Glu 


Lys 


Lys 








o4 U 


Tyr 


Val 


Glu 


Pro 






355 




Thr 


Val 


Arg 


1 le 




370 






Lys 


Leu 




Tyr 


385 








Thr 


He 


biy 


Ser 


Val 


Lys 


Tyr 


Ser 








/ion 
4 


He 


Asp 


OCX 


o J. y 






435 




Glu 


Thr 


Leu 


Leu 




450 






Asn 


Pro 


Lys 


Gin 


465 








Val 


Asn 


Asp 


Asn 



Ala 


Leu 


Glu 


Leu 








40 


Arg 


Ser 


Trp 


Met 










Ser 


Asp 


Tyr 


Gin 




/U 






Asp 


Gly 


Ser 


Leu 


85 








Phe 


He 


He 


Asn 


Asp 


Arg 


Glu 


Glu 








120 


Arg 


Arg 


Thr 


Gly 






135 




He 


His 


Asp 


He 




150 






Thr 


Ala 


Thr 


Val 


1 DO 








Val 


Thr 


Ala 


Thr 


Val 


Val 


Tyr 


Ser 








200 


Glu 


Thr 


Gly 


He 






215 




Arg 


Glu 


Gin 


Tyr 




230 






Met 


Gly Gly Leu 


245 








Val 


Asn 


Asp 


Asn 


Thr 


Pro 


Glu 


Ser 








280 


Ser 


Asp Ala Asp 






295 




Asp 


Gly Glu Gly 




310 






Gin 


Glu 


Gly 


lie 


325 








Lys 


Val 


Tyr 


Thr 


Arg 


Phe 


Leu 


Tyr 








360 


Val 


Val 


Glu 


Asp 






375 




He 


Leu 


Gin 


He 




390 






Val 


Thr 


Ala 


Gin 


4Uo 








Val 


Asp Arg 


His 


Asn 


Gly 


Ser 


He 








440 


Trp 


His 


Asn 


He 






455 




Ser 


Ser 


Arg 


Val 




470 






Ala 


Pro 


Glu 


Phe 


485 









Ser 


Gly 


Asn 


Ser 


Trp 


Asn 


Gin 


Phe 








60 


Tyr 


Val 


Gly Lys 






75 




Lys 


Tyr 


He 


Leu 




90 






Glu 


Asn 


Thr 


Gly 


105 








Lys 


Pro 


Val 


Tyr 


Arg 


Pro 


Val 


Glu 








140 


Asn 


Asp 


Asn 


Glu 






155 




Pro 


Glu 


Met 


Ser 




170 






Asp 


Ala 


Asp Asp 


185 








He 


Leu 


Gin 


Gly 


He 


Lys 


Thr 


Ala 








220 


Gin 


Val 


Val 


He 






235 




Ser 


Gly Thr 


Thr 




250 






Pro. 


Pro Arg 


Phe 


265 








Ser 


Pro 


Pro 


Gly 


Val 


Gly Glu 


Asn 








300 


Leu 


Asp Met 


Phe 






315 




He 


Thr 


Val 


Lys 




330 






Leu 


Lys 


Val 


Glu 










Leu 


Gly 


Pro 


Phe 


Val 


Asp 


Glu 


Pro 








380 


Arg 


Glu 


Asp 


Ala 






395 




TV MM 

Asp 


Pro Asp Ala 




410 






Thr 


Asp Met 


Asp 


4^0 








Phe 


Thr 


Ser 


Lys 


Thr 


Val 


He 


Ala 








460 


Pro 


Leu 


Tyr 


He 






475 




Ala 


Glu 


Phe 


Tyr 




4 90 







30 

Lys Asn Glu Leu 
45 

Phe Leu Leu Glu 

Leu His Ser Asp 
80 

Ser Gly Asp Gly 
95 

Asp He Gin Ala 

110 

He Leu Arg Ala 
125 

Pro Glu Ser Glu 

Pro He Phe Thr 
160 

Asp Val Gly Thr 
175 

Pro Thr Tyr Gly 

190 

Gin Pro Tyr Phe 
205 

Leu Leu Asn Met 

Gin Ala Lys Asp 

240 

Thr Val Asn He 
255 

Pro Gin Ser Thr 

270 

Thr Pro He Gly 
285 

Ala Glu He Glu 

Asp Val He Thr 

320 

Lys Leu Leu Asp 
335 

Ala Ser Asn Pro 

350 

Lys Asp Ser Ala 
365 

Pro Val Phe Ser 

Gin He Asn Thr 
400 

Ala Arg Asn Pro 
415 

Arg He Phe Asn 
430 

Leu Leu Asp Arg 
445 

Thr Glu He Asn 

Lys Val Leu Asp 
480 

Glu Thr Phe Val 
495 
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Cys 




Lys 


riXd 


Lys Ala Asp Gin Leu 


lie 


m n 


X IIX 


Leu 


His 


Ala 


Val 










505 










510 






Asp 


Lys 


Asp 


Asp 


Pro Tyr Ser Gly His Gin 


Irllc 


Cci -v- 


Phe 


Ser 


Leu 


Ala 






3X3 














525 










f^l n 


AX d 


Ala 
AXd 


oer oxy oer nsn 


trne 


1 nx 


lie 


ox 1 1 


Asp Asn 


Lys Asp 




























Asn 


i nr 


Hxa 


oxy 


ixe lieu 1 nx /ixy 


Lys 


Asn 


oxy 


iyr 


7\ o r> 


Arg 


His 


Glu 


dhd 








3 3 U 






ODD 










560 


Met 




1 fix 


Tyr 


ij6U XieU irXO VaX 


Vox 


T 1 o 
XX6 


OCX 


Asp 


Asn 


Asp 


Tyr 


Pro 














570 










575 




Vai. 


o±n 


C A V 

oex 




i nr vjxy irix vox 


Thr- 

1 IIL 


Vai 


AX y 


vox 


Cys 


Ala 


Cys Asp 








c; ft 0 
0 0 u 




585 










590 












A c n 


Met Gin Ser Cys 


His 


Ala 


Giu 


Al Pk 
A J. d 


Leu 


He 


His 


Pro 










600 










605 










uxy 


L61J 




Thr Gly Ala Leu 


Val 


Ala 


lie 


Leu 


Leu 


Cys 


lie 


Val 




Dl U 






615 








620 












L6U 


L6li 


V d X 


Thr Val Val Leu 


Phe 


Ala 


Ala 


Leu 


Arg 


Arg 


Gin 


Arg 










630 






635 










640 


Lys 


Lys 


r:i n 


It i O 


Leu lie lie Ser 


Lys 


Giu Asp 


He 


Arg Asp Asn 


He 










645 




650 










655 








lyr 


Ao 1 1 


Asp Giu Gly Gly Gly Glu Giu Asp 


Thr 


Gin 


Ala 


Phe 








o ou 




665 










670 






Asp 


T 1 <a 


oxy 


I nx 


Leu Arg Asn Pro 


Glu 


Ala 


lie 


Giu 


Asp 


Asn 


Lys 


Leu 






D / D 




680 










685 








Arg 


Ar'(^ 


Asp 


T 1 o 

X xe 


Val Pro Glu Ala 


Leu 


Phe 


Leu 


Pro 


Arg 


Arg 


Thr 


Pro 




oyu 






695 








700 










Thy 


r\±a 


AlTQ 


Asp 


Asn Thr Asp Val 


Arg 


Asp 


Phe 


He 


Asn 


Gin 


Arg 


Leu 


/ uo 








710 






715 










720 


Lys 




Asn 


Asp 


Thr Asp Pro Thr 


Ala 


Pro 


Pro 


Tyr 


Asp 


Ser 


Leu 


Ala 










725 




730 










735 




Th-r 


T V r 

lyr 


AT ^ 

nxa 


1 yx 


Glu Gly Thr Gly 


Ser 


Val 


Ala 


Asp 


Ser 


Leu 


Ser 


Ser 








740 




745 










750 






Leu 


Glu 


Ser 


Val 


Thr Thr Asp Ala Asp Gin Asp Tyr 


Asp 


Tyr 


Leu 


Ser 






755 




7 60 










765 








Asp 


Trp 


Gly 


Pro 


Arg Phe Lys Lys 


Leu 


Ala 


Asp 


Met 


Tyr 


Gly Gly Val 




770 






775 








780 










Asp 


Ser 


Asp 


Lys 


Asp Ser 



















785 790 



<210> 83 
<211> 1070 
<212> DMA 

<213> Homo sapiens 
<400> 83 

gtcgacccac gcgtccgctt tgggtgaccg gaaaactcca cctcaagttt tcttttgtgg 60 

ggctgccccc caagtgtcgt ttgttttact gtagggtctc ccgcccggcg cccccagtgt 120 

tttctgaggg cggaaatggc caattcgggc ctgcagttgc tgggcttctc catggccctg 180 

ctgggctggg tgggtctggt ggcctgcacc gccatcccgc agtggcagat gagctcctat 240 

gcgggtgaca acatcatcac ggcccaggcc atgtacaagg ggctgtggat ggactgcgtc 300 

acgcagagca cggggatgat gagctgcaaa atgtacgact cggtgctcgc cctgtccgcg 360 

gccttgcagg ccactcgagc cctaatggtg gtctccctgg tgctgggctt cctggccatg 420 

tttgtggcca cgatgggcat gaagtgcacg cgctgtgggg gagacgacaa agtgaagaag 480 

gcccgtatag ccatgggtgg aggcataatt ttcatcgtgg caggtcttgc cgccttggta 540 

gcttgctcct ggtatggcca tcagattgtc acagactttt ataacccttt gatccctacc 600 

aacattaagt atgagtttgg ccctgccatc tttattggct gggcagggtc tgccctagtc 660 

atcctgggag gtgcactgct ctcctgttcc tgtcctggga atgagagcaa ggctgggtac 720 

cgtgcacccc gctcttaccc taagtccaac tcttccaagg agtatgtgtg acctgggatc 780 
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tccttgcccc agcctgacag gctatgggag tgtctagatg cctgaaaggg cctggggctg 840 
agctcagcct gtgggcaggg tgccggacaa aggcctcctg gtcactctgt ccctgcactc 900 
catgtatagt cctcttgggt tgggggtggg ggggtgccgt tggtgggaga gacaaaaaga 960 
gggagagtgt gctttttgta cagtaataaa aaataagtat tgggaagcag gcaaaaaaaa 1020 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa gggcggccgc 1070 

<210> 84 

<211> 211 

<212> PRT 

<213> Homo sapiens 



<400> 84 



rieu nidi 


Asn 


oer 


Gly Leu 


Cain 


Leu 


Leu 


Caiy 


rne 


Ser 


Met 


Ala 


Leu 


Leu 


X 






5 










10 










15 




tjiy irp 


vai 


biy 


Leu 


Val 


Ala 


Cys 


inr 


A±a 


T 1 « 

He 


Pro 


Gin 


Trp 


Gin 


Met 
















25 










30 








Tyr 




Gly Asp 


Asn 


1 ie 


Tin 

He 


Thr 


Ala 


Gin 


Ala 


Met 


Tyr 


Lys 




o c 










4 0 










45 










irp 




Asp 


Cys 




1 nr 


\^j.n 


Ser 


X nr 


taiy 


Me L 


Met 


Ser 


Cys 


50 










55 










60 










Lys Met 


Tyr 


Asp 


Ser 


Val 


Leu 


Ala 


Leu 


Ser 


Ala 


Ala 


Leu 


Gin 


Ala 


Thr 


65 








70 










75 










80 


Arg Ala 


Leu 


Met 


Val 


Val 


Ser 


Leu 


Val 


Leu 


Gly 


Phe 


Leu 


Ala 


Met 


Phe 








85 










90 










95 




Val Ala 


Thr 


Met 


Gly Met 


Lys 


Cys 


Thr 


Arg 


Cys 


Gly 


Gly 


Asp 


Asp 


Lys 






100 










105 










110 






Val Lys 


Lys 


Ala 


Arg 


He 


Ala 


Met 


Gly 


Gly 


Gly 


He 


He 


Phe 


He 


Val 




115 










120 










125 








Ala Gly 


Leu 


Ala 


Ala 


Leu 


Val 


Ala 


Cys 


Ser 


Trp 


Tyr 


Gly 


His 


Gin 


He 


130 










135 










140 










Val Thr 


Asp 


Phe 


Tyr 


Asn 


Pro 


Leu 


He 


Pro 


Thr 


Asn 


He 


Lys 


Tyr 


Glu 


145 








150 










155 










160 


Phe Gly 


Pro 


Ala 


lie 


Phe 


He 


Gly 


Trp 


Ala 


Gly 


Ser 


Ala 


Leu 


Val 


He 








165 










170 










175 




Leu Gly 


Gly 


Ala 


Leu 


Leu 


Ser 


Cys. 


Ser 


Cys 


Pro 


Gly 


Asn 


Glu 


Ser 


Lys 






180 










185 










190 






Ala Gly 


Tyr 


Arg 


Ala 


Pro 


Arg 


Ser 


Tyr 


Pro 


Lys 


Ser 


Asn 


Ser 


Ser 


Lys 



195 200 205 



Glu Tyr Val 
210 



<210> 85 

<211> 1733 

<212> DNA 

<213> Homo sapiens 

<400> 85 

gtcgacccac gcgtccgctg ggtcctgcct tcgacaccac cccaaggctt cctaccttgc 60 
gtgcctggag tctgccccag gggcccttgt cctgggccat ggcccagaag ggggtcctgg 120 
ggcctgggca gctgggggct gtggccattc tgctctatct tggattactc cggtcaggga 180 
caggagcgga aggggcagaa gctccctgcg gtgtggcccc ccaagcacgc atcacaggtg 240 
gcagcagtgc agtcgccggt cagtggccct ggcaggtcag catcacctat gaaggcgtcc 300 
atgtgtgtgg tggctctctc gtgtctgagc agtgggtgct gtcagctgct cactgcttcc 360 
ccagcgagca ccacaaggaa gcctatgagg tcaagctggg ggcccaccag ctagactcct 420 
actccgagga cgccaaggtc agcaccctga aggacatcat cccccacccc agctacctcc 480 
aggagggctc ccagggcgac attgcactcc tccaactcag cagacccatc accttctccc 540 
gctacatccg gcccatctgc ctccctgcag ccaaicgcctc cttccccaac ggcctccact 600 
gcactgtcac tggctggggt catgtggccc cctcagtgag cctcctgacg cccaagccac 660 
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100 

tgcagcaact cgaggtgcct ctgatcagtc gtgagacgtg taactgcctg tacaacatcg 720 
acgccaagcc tgaggagccg cactttgtcc aagaggacat ggtgtgtgct ggctatgtgg 780 
aggggggcaa ggacgcctgc cagggtgact ctgggggccc actctcctgc cctgtggagg 840 
gtctctggta cctgacgggc attgtgagct ggggagatgc ctgtggggcc cgcaacaggc 900 
ctggtgtgta cactctggcc tccagctatg cctcctggat ccaaagcaag gtgacagaac 960 
tccagcctcg tgtggtgccc caaacccagg agtcccagcc cgacagcaac ctctgtggca 1020 
gccacctggc cttcagctct gccccagccc agggcttgct gaggcccatc cttttcctgc 1080 
ctctgggcct ggctctgggc ctcctctccc catggctcag cgagcactga gctggcccta 1140 
cttccaggat ggatgcatca cactcaagga caggagcctg gtccttccct gatggccttt 1200 
ggacccaggg cctgacttga gccactcctt ccttcaggac tctgcgggag gctggggccc 1260 
catcttgatc tttgagccca ttcttctggg tgtgcttttt gggaccatca ctgagagtca 1320 
ggagttttac tgcctgtagc aatggccaga gcctctggcc cctcacccac catggaccag 1380 
cccattggcc gagctcctgg ggagctcctg ggacccttgg ctatgaaaat gagccctggc 1440 
tcccacctgt ttctggaaga ctgctcccgg cccgcctgcc cagactgatg agcacatctc 1500 
tctgccctct ccctgtgttc tgggctgggg ccacctttgt gcagcttcga ggacaggaaa 1560 
ggccccaatc ttgcccactg gccgctgagc gcccccgagc cctgactcct ggactccgga 1620 
ggactgagcc cccaccggaa ctgggctggc gcttggatct ggggtgggag taacagggca 1680 
gaaatgatta aaatgtttga gcacaaaaaa aaaaaaaaaa aaagggcggc cgc 1733 

<210> 86 
<211> 343 
<212> PRT 

<213> Homo sapiens 
<400> 86 



Met 


Ala 


Gin 


Lys 


Gly 


Val 


Leu 


Gly 


Pro 


Gly 


Gin 


Leu 


Gly 


Ala 


Val 


Ala 


1 








5 










10 










15 




lie 


Leu 


Leu 


Tyr 
20 


Leu 


Gly 


Leu 


Leu 


Arg 
25 


Ser 


Gly 


Thr 


Gly 


Ala 

30 


Glu 


Gly 


Ala 


Glu 


Ala 


Pro 


Cys 


Gly 


Val 


Ala 


Pro 


Gin 


Ala 


Arg 


He 


Thr 


Gly 


Gly 






35 










40 










45 






Ser 


Ser 


Ala 


Val 


Ala 


Gly 


Gin 


Trp 


Pro 


Trp 


Gin 


Val 


Ser 


He 


Thr 


Tyr 




50 










55 










60 








Glu 


Gly 


Val 


His 


Val 


Cys 


Gly 


Gly 


Ser 


Leu 


Val 


Ser 


Glu 


Gin 


Trp 


Val 


65 










70 










75 










80 


Leu 


Ser 


Ala 


Ala 


His 


Cys 


Phe 


Pro 


Ser 


Glu 


His 


His 


Lys 


Glu 


Ala 


Tyr 










85 










90 










95 


Glu 


Val 


Lys 


Leu 
100 


Gly 


Ala 


His 


Gin 


Leu 
105 


Asp 


Ser 


Tyr 


Ser 


Glu 

110 


Asp 


Ala 


Lys 


Val 


Ser 

115 


Thr 


Leu 


Lys 


Asp 


He 

120 


He 


Pro 


His 


Pro 


Ser 
125 


Tyr 


Leu 


Gin 


Glu 


Gly 
130 


Ser 


Gin 


Gly 


Asp 


He 
135 


Ala 


Leu 


Leu 


Gin 


Leu 
140 


Ser 


Arg 


Pro 


He 


Thr 


Phe 


Ser 


Arg 


Tyr 


He 


Arg 


Pro 


He 


Cys 


Leu 


Pro 


Ala 


Ala 


Asn 


Ala 


145 










150 










155 










160 


Ser 


Phe 


Pro 


Asn 


Gly 
165 


Leu 


His 


Cys 


Thr 


Val 
170 


Thr 


Gly 


Trp 


Gly 


His 
175 


Val 


Ala 


Pro 


Ser 


Val 
180 


Ser 


Leu 


Leu 


Thr 


Pro 

185 


Lys 


Pro 


Leu 


Gin 


Gin 

196 


Leu 


Glu 


Val 


Pro 


Leu 
195 


He 


Ser 


Arg 


Glu 


Thr 
200 


Cys 


Asn 


Cys 


Leu 


Tyr 
205 


Asn 


He 


Asp 


Ala 


Lys 


Pro 


Glu 


Glu 


Pro 


His 


Phe 


Val 


Gin 


Glu 


Asp 


Met 


Val 


Cys 


Ala 




210 










215 










220 








Gly 


Tyr 


Val 


Glu 


Gly 


Gly 


Lys 


Asp 


Ala 


Cys 


Gin 


Gly 


Asp 


Ser 


Gly Gly 


225 










230 










235 










240 


Pro 


Leu 


Ser 


Cys 


Pro 
245 


Val 


Glu 


Gly 


Leu 


Trp 
250 


Tyr 


Leu 


Thr 


Gly 


He 
255 


Val 


Ser 


Trp 


Gly 


Asp 
260 


Ala 


Cys 


Gly 


Ala 


Arg 
265 


Asn 


Arg 


Pro 


Gly 


Val 
270 


Tyr 


Thr 
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Leu Ala Ser 


Ser 


Tyr 


Ala 


Ser 


Trp 


He 


Gin 


Ser 


Lys 


Val Thr Glu Leu 


£.10 










o o ^ 










285 


Gin Pro Arg 


Val 


Val 


Pro 


Gin 


Thr 


Gin 


Glu 


Ser 


Gin 


Pro Asp Ser Asn 


290 








o A e 

295 










300 


Leu Cys Gly 


Ser 


His 


Leu 


Ala 


Phe 


Ser 


Ser 


Ala 


Pro 


Ala Gin Gly Leu 


305 






310 










315 




320 


Leu Arg Pro 


He 


Leu 
325 


Phe 


Leu 


Pro 


Leu 


Gly 
330 


Leu 


Ala 


Leu Gly Leu Leu 
335 


Ser Pro Trp 


Leu 
340 


Ser 


Glu 


His 















<210> 87 

<211> 4188 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 3457, 3481, 3707, 3716, 3723, 3733, 3736, 3746, 3751, 3828, 
3853, 3857, 3863, 3883, 3890, 4126. 
<223> n = A,T,C or G 

<400> 87 

ggctccttac ccacccggag actttttttt gaaaggaaac tagggaggga gggagaggga 60 
gagagggaga aaacgaaggg gagctcgtcc atccattgaa gcacagttca ctatgatctt 120 
actcacattc agcactggaa gacggttgga tttcgtgcat cattcggggg tgtttttctt 180 
gcaaaccttg ctttggattt tatgtgctac agtctgcgga acggagcagt atttcaatgt 240 
ggaggtttgg ttacaaaagt acggctacct tccaccgact gaccccagaa tgtcagtgct 300 
gcgctctgca gagaccatgc agtctgccct agctgccatg cagcagttct atggcattaa 360 
catgacagga aaagtggaca gaaacacaat tgactggatg aagaagcccc gatgcggtgt 420 
acctgaccag acaagaggta gctccaaatt tcatattcgt cgaaagcgat atgcattgac 480 
aggacagaaa tggcagcaca agcacatcac ttacagtata aagaacgtaa ctccaaaagt 540 
aggagaccct gagactcgta aagctattcg ccgtgccttt gatgtgtggc agaatgtaac 600 
tcctctgaca tttgaagaag ttccctacag tgaattagaa aatggcaaac gtgatgtgga 660 
tataaccatt atttttgcat ctggtttcca tggggacagc tctccctttg atggagaggg 720 
aggatttttg gcacatgcct acttccctgg accaggaatt ggaggagata cccattttga 780 
ctcagatgag ccatggacac taggaaatcc taatcatgat ggaaatgact tatttcttgt 840 
agcagtccat gaactgggac atgctctggg attggagcat tccaatgacc ccactgccat 900 
catggctcca ttttaccagt acatggaaac agacaacttc aaactaccta atgatgattt 960 
acagggcatc cagaagatat atggtccacc tgacaagatt cctccaccta caagacctct 1020 
accgacagtg cccccacacc gctctattcc tccggctgac ccaaggaaaa atgacaggcc 1080 
aaaacctcct cggcctccaa ccggcagacc ctcctatccc ggagccaaac ccaacatctg 1140 
tgatgggaac tttaacactc tagctattct tcgtcgtgag atgtttgttt tcaaggacca 1200 
gtggttttgg cgagtgagaa acaacagggt gatggatgga tacccaatgc aaattactta 1260 
cttctggcgg ggcttgcctc ctagtatcga tgcagtttat gaaaatagcg acgggaattt 1320 
tgtgttcttt aaaggtaaca aatattgggt gttcaaggat acaactcttc aacctggtta 1380 
ccctcatgac ttgataaccc ttggaagtgg aattccccct catggtattg attcagccat 1440 
ttggtgggag gacgtcggga aaacctattt cttcaaggga gacagatatt ggagatatag 1500 
tgaagaaatg aaaacaatgg accctggcta tcccaagcca atcacagtct ggaaagggat 1560 
ccctgaatct cctcagggag catttgtaca caaagaaaat ggctttacgt atttctacaa 1620 
aggaaaggag tattggaaat tcaacaacca gatactcaag gtagaacctg gatatccaag 1680 
atccatcctc aaggatttta tgggctgtga tggaccaaca gacagagtta aagaaggaca 1740 
cagcccacca gatgatgtag acattgtcat caaactggac aacacagcca gcactgtgaa 1800 
agccatagct attgtcattc cctgcatctt ggccttatgc ctccttgtat tggtttacac 1860 
tgtgttccag ttcaagagga aaggaacacc ccgccacata ctgtactgta aacgctctat 1920 
gcaagagtgg gtgtgatgta gggttttttc ttctttcttt cttttgcagg agtttgtggt 1980 
aacttgagat tcaagacaag agctgttatg ctgtttccta gctaggagca ggcttgtggc 2040 
agcctgattc ggggctgacc tttcaaacca gagggttgct ggtcctgcac atgagtggaa 2100 
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atacactcat 
ttctttgtca 

gattaactaa 
attttccttc 
cccacggttt 
aaaaaagagt 
acagctttgt 
ttttgtcagg 
tttccaacaa 
atttttcgtt 
catagctatt 
tccaccttac 
ggccttccat 
tctgaagtgg 
gtgttaccct 
atctgataaa 
tttttaaaac 
aatgtgttta 
aggtatttta 
taagtcttct 
aaattcaatt 
gtgactctga 
tctcaaaggg 
ngagaccacc 
ttcatgtgac 
ctgacagacc 
actgggctgt 
ttnggctttg 
aaccagcctt 
cttaaacatt 
gccctttgac 
cattggatta 
sgtggtgatg 
ccctataggt 
cattttagta 



ggggaagctt 
ttcagttcta 

agaagaggag 
tgaagtctga 
ttcttttttt 
ttaaacaaaa 
ctccaaagaa 
tgggagattt 
ccaaggggcc 
ttatacagct 
ttcaaacacg 
tcctccatca 
gggtctgtca 
attacagtgg 
ttgagaacta 
ttatagaact 
agatatccta 
taacaaacag 
ttaagattct 
tttctttttt 
tgccttggta 
agttgcatga 
ttagtgtggc 
aaagcatttc 
ctgcagagca 
tctttattga 
gaacttatta 
tgntgnaaag 
gcccttgaaa 
gtncttnctg 
ctttaacttg 
gttgttctct 
gcagatctgc 
agaagtgaat 
caaattacaa 



ccatgatgca 
ggcctttcct 
aaaaagaaga 
gcccatttct 
ctttttttct 
aacccacaat 
gatagctctc 
ggaatacaca 
aaaatatctg 
gttctcagct 
accttttaat 
ttgtgataat 
ggaatattcg 
cagttatttt 
tcagaccagc 
gggcaatggt 
taatgtcata 
aaatgatgtt 
cataaatctg 
ttaggcatac 
tatgcagtta 
atatcctcca 
ttctggcatt 
attgttgagt 
ggtttgtatc 
tttttaggag 
gagatggtca 
cccagncttt 
cttgaattaa 
tgnaaaattt 
caaactggca 
aaaacctagt 
agatggacac 
tttaatttca 
aaactatagg 



cagtatctgc 

ctgcacgctc 

aaagattctt 

ggggggagaa 

ttttttcttt 

tgaacttcca 

tgactgcttt 

tgcaggacgt 

caatatagta 

atgtcctcag 

tgtttttgaa 

cttcccaagt 

ttacaaagca 

acaaggattt 

tytcagagtc 

aaaaacagtc 

taatttttaa 

acttgccaaa 

aaattttatt 

tgaatttctg 

gcattgccat 

gtgcattacc. 

tagccgncca 

gtaatttgtc 

aatatttttt 

ctgcttcttg 

gaatgaatgc 

ngaggggatt 

ttgatcctat 

tccttgaaga 

caaactgaag 

aagcatgagc 

tttgctcttt 

ttttaatata 

aaaaaaaaaa 



tgttcttcag 
aatgcccagt 
tcttaaaagt 
aaaaaaagca 
tttggcttta 
ggaaagtgtg 
ggatagtctc 
tagactgttg 
acagccttaa 
tgtttcatcg 
gtatttctaa 
tgtattaggc 
gagcaagaag 
gtgacactag 
ttaggattgt 
acaagttcaa 
atgatttaca 
atttttctgg 
tgaaaaaact 
ttttaaaatc 
tttaaaaatg 
tattgcatgt 
tttgatcact 
ctaacagcag 
tcctagagaa 
cagtgaaagg 
accccantga 
agccttttgg 
tgactgtnca 
gtncctgttn 
gaaatctggt 
tgtttcctta 
acatgcacac 
atttcnaagt 
aaaaaaaa 



tcctttgtct 
aaaatttcag 

ttctaatgtt 
aatcagaaaa 
aaacaaaggg 

aagacccaaa 
ctacgcacca 
ggacagccat 
taatacatcc 
catttatatt 
accccttctt 
cattgcccca 
gcagtatgtc 
ttacataccc 
cgstcttgcg 
gaagttcagg 
agactacata 
caaataaaaa 
gataatagcc 
cattgcatga 
aattaaaacg 
ccaccatagt 
gacagagcca 
tattgtcatt 
aagtcagcaa 
ctttacagcc 
gtcagnamca 
aaaacaaatg 
ttaacaacaa 
ctatgtcttt 
gttgcttctc 
gagtggagag 
tctgaaaatg 
ctaaattcat 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4188 



<210> 88 
<211> 607 
<212> PRT 

<213> Homo sapiens 



<400> 88 



Met 


He 


Leu 


Leu 


Thr 


Phe 


Ser 


Thr 


1 








5 








His 


Ser 


Gly 


Val 

20 


Phe 


Phe 


Leu 


Gin 


Thr 


Val 


Cys 
35 


Gly 


Thr 


Glu 


Gin 


Tyr 
40 


Lys 


Tyr 
50 


Gly 


Tyr 


Leu 


Pro 


Pro 
55 


Thr 


Ser 


Ala 


Glu 


Thr 


Met 


Gin 


Ser 


Ala 


65 










70 






Gly 


He 


Asn 


Met 


Thr 
85 


Gly 


Lys 


Val 


Lys 


Lys 


Pro 


Arg 
100 


Cys 


Gly 


Val 


Pro 


Phe 


His 


He 
115 


Arg 


Arg 


Lys 


Arg 


Tyr 
120 


His 


Lys 


His 


He 


Thr 


Tyr 


Ser 


He 



Gly 


Arg 
10 


Arg 


Leu 


Asp 


Phe 


Val 
15 


His 


Thr 


Leu 


Leu 


Trp 


He 


Leu 


Cys 


Ala 


25 










30 






Phe 


Asn 


Val 


Glu 


Val 
45 


Trp 


Leu 


Gin 


Asp 


Pro 


Arg 


Met 
60 


Ser 


Val 


Leu 


Arg 


Leu 


Ala 


Ala 
75 


Met 


Gin 


Gin 


Phe 


Tyr 
80 


Asp 


Arg 
90 


Asn 


Thr 


He 


Asp 


Trp 
95 


Met 


Asp 


Gin 


Thr 


Arg 


Gly 


Ser 


Ser 


Lys 


105 










110 






Ala 


Leu 


Thr 


Gly 


Gin 
125 


Lys 


Trp 


Gin 


Lys 


Asn 


Val 


Thr 


Pro 


Lys 


Val 


Gly 
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130 135 140 



Asp Pro 


oXU 


inr 


Arg 


J-»ys 


Axa 


xxe 


Arg 


Arg 


Ala 


fne 


Asp 


Val 


Trp 


Gin 


X *i 3 








150 










XDO 










X DU 


TV e. 1-1 \73i 1 


i nr 


rro 


Leu 


Thr 


Phe 


LiXU 


\jXU 


vax 


Pro 


Tyr 


Ser 


Glu 


Leu 


Glu 








X 03 










1 7n 

X / U 










X /o 




e n mi \7 


Liys 


Arg 


Asp 


Val 


Asp 


xxe 


1 nr 


T 1 o 

xxe 


T 1 o 

xxe 


fne 


AX a 


Ser 


Gly 


Pne 






xo u 










X o ^ 










lyu 






Zi en 
nSp 


Ser 


Ser 


Pro 


Phe 


Asp 


oxy 


L»XU 


uxy 


oxy 


fne 


Leu 


Ala 


His 
























on R 
^UD 








A J. a 1 yx 


Pho 


Pro 


bxy 


Pro Gly 


xxe 


oxy 


oxy 


Asp 


1 nr 


LI ^ n 

nlS 


Pne 


Asp 


Ser 






















u 












trLO 


Trp 


i nr 


Leu 


biy 


Asn 


tr ro 


Asn 


nl S 


Asp 


bxy 


Asn 


Asp 


Leu 


^ 4. -J 








230 




















z4 U 


D )^ A T 1 


VaL± 


/iXa 


Val 


His 


bXU 


Leu 


oxy 


nXS 


Ala 


Leu 


bXy 


Leu 


Glu 


His 








245 










^DU 










o c c 

^DD 




T* A o 


Asp 


Pro 


Thr 


Ala 


X xe 


JxieL 


Axa 


Pro 


fne 


Tyr 


bxn 


Tyr 


Met 


Glu 






^ ou 




















o T 
z /U 






xni risp 


Asn 


roe 


Lys 


Leu 


Pro 


Asn 


Asp 


Asp 


Leu 


PT »-% 


biy 


-r T 

He 


Gin 


Lys 














/.ok) 










o o c 
Zoo 








1 ± e 1 y r 


uxy 


Pro 


Pro Asp 


Lys 


xxe 


Pro 


Pro 


Pro 


Tnr 


Arg 


Pro 


Leu 


Pro 


^ 27 U 










295 




















i nr V ax 


Pro 


Pro 


His 


Arg 


Ser 


xxe 


Pro 


Pro 


Aia 


Asp 


Pro 


Arg 


Lys 


Asn 










310 










Ol D 










•5 rt 
320 


Asp Arg 


Pro 


Lys 


Pro 


Pro 


Arg 


Pro 


Pro 


Thr 


Gly 


Arg 


Pro 


Ser 


Tyr 


Pro 








325 




















"3 '3 C 

335 




o±y t\±ci 


Lys 


Pro 


Asn 


He 


Cys 


Asp 


(jXy 


TV _ 

Asn 


fne 


Asn 


Thr 


Leu 


Ala 


He 






■^4 0 










"^4 

J 4 o 










"3 e; n 
JDU 






Lsu Ax'Cf 


Arg 


uXU 


Met 


Phe 


Val 


fne 


Lys 


ASp 


Gin 


Trp 


Phe 


Trp 


Arg 


Val 














^oU 










3dd 








Ar<3 Asn 


Asn 


Arg 


Val 


Met 


Asp 


bxy 


Tyr 


Pro 


Met 


Gin 


He 


Thr 


Tyr 


Phe 












375 










oo U 










^ V 1^ 7\ ^ 

i rp Arg 


\jxy 


Leu 


Pro 


Pro 


Ser 


xie 


ASp 


Ala 


Val 


Tyr 


Glu 


Asn 


Ser 


Asp 


J o ^ 








390 










Q t; 

D^D 










400 


oxy Asn 


rne 


vax 


Phe 


Phe 


Lys 


CaXy 


n r-i 

Asn 


Lys 


Tyr 


Trp 


Val 


Phe 


Lys 


Asp 








405 










4 1 
4 lU 










4lD 




A fii 1 fli 


Leu 


PI n 

oxn 


Pro Gly Tyr 


Pro 


his 


Asp 


Leu 


He 


Thr 


Leu 


Gly 


Ser 






4 9 0 
<c U 










4 o c; 










430 




«±y ixe 


Pro 


fro 


His 


Gly 


He 


Asp 


oer 


AX a 


T 1 « 

He 


Trp 


Trp 


Glu 


Asp 


Val 
























44d 








\jj.y LtyS 


inr 


Tyr 


Phe 


Phe 


Lys 


oXy 


Asp 


Arg 


Tyr 


Trp 


Arg 


Tyr 


Ser 


Glu 


*J J V 










455 










A c.r\ 
^ bU 












Lys 


1 nr 


Met 


Asp 


Pro 


bXy 


Tyr 


Pro 


Lys 


Pro 


He 


Thr 


Val 


Trp 


T VJ J 








470 










H 1 Z> 










A Q f\ 

4oO 


jjys oxy 


X X6 


rx O 


Glu 


Ser 


Pro 




Csxy 


Aia 


fne 


vax 


His 


Lys 


Glu 


Asn 








485 










4 on 
4 y u 














CT V Php 
_y die 


X IIX 


1 yx 


Phe 


Tyr 


Lys 


uxy 


Lys 


oXU 


Tyr 


Trp 


Lys 


fne 


Asn 


Asn 
















DUD 










DXU 








XjCU 


T.\/c 
Ltyo 


Val 


Glu 


Pro 


KsLy 


Tyr 


Pro 


Arg 


Ser 


xxe 


Leu 


Lys 


Asp 




515 










^9(\ 










DZ D 










tjxy 


y 5 


Asp Gly 


Pro 


1 nr 


Asp 


Arg 


vax 


Lys 


vjIU 


Gly 


nlS 


Ser 


>J u 










535 










^4 0 
D^ U 










D -r~ ^ D V 


Asp 


Asp 


Val 


Asp 


He 


vax 


xxe 


Lys 


Leu 


Asp 


Asn 


Thr 


Ala 


Ser 


545 








550 










555 










560 


Thr Val 


Lys 


Ala 


He 


Ala 


He 


Val 


He 


Pro 


Cys 


He 


Leu 


Ala 


Leu 


Cys 








565 










570 










575 


Leu Leu 


Val 


Leu 


Val 


Tyr Thr 


Val 


Phe 


Gin 


Phe 


Lys 


Arg 


Lys 


Gly 


Thr 






580 










585 










590 






Pro Arg 


His 


He 


Leu 


Tyr Cys 


Lys 


Arg 


Ser 


Met 


Gin 


Glu 


Trp 


Val 






595 










600 










605 
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<210> 89 . 

<211> 3438 

<212> DNA 

<213> Homo sapiens 



<400> 89 

atccccgggc ccgagctcga attccaggtg ccccagtagc ccgaccgccg agatgcccag 60 

cccgccgggg ctccgggcgc tatggctttg cgccgcgctg tgcgcttccc ggagggccgg 120 

cggcgccccc cagcccggcc cggggcccac cgcctgcccg gccccctgcc actgccagga 180 

ggacggcatc atgctgtctg ccgactgctc tgagctcggg ctgtccgccg ttccggggga 240 

cctggacccc ctgacggctt acctggacct cagcatgaac aacctcacag agcttcagcc 300 

tggcctcttc caccacctgc gcttcttgga ggagctgcgt ctctctggga accatctctc 360 

acacatccca ggacaagcat tctctggtct ctacagcctg aaaatcctga tgctgcagaa 420 

caatcagctg ggaggaatcc ccgcagaggc gctgtgggag ctgccgagcc tgcagtcgct 480 

gcgcctagat gccaacctca tctccctggt cccggagagg agctttgagg ggctgtcctc 540 

cctccgccac ctctggctgg acgacaatgc actcacggag atccctgtca gggccctcaa 600 

caacctccct gccctgcagg ccatgaccct ggccctcaac cgcatcagcc acatccccga 660 

ctacgcgttc cagaatctca ccagccttgt ggtgctgcat ttgcataaca accgcatcca 720 

gcatctgggg acccacagct tcgaggggct gcacaatctg gagacactag acctgaatta 780 

taacaagctg caggagttcc ctgtggccat ccggaccctg ggcagactgc aggaactggg 840 

gttccataac aacaacatca aggccatccc agaaaaggcc ttcatgggga accctctgct 900 

acagacgata cacttttatg ataacccaat ccagtttgtg ggaagatcgg cattccagta 960 

cctgcctaaa ctccacacac tatctctgaa tggtgccatg gacatccagg agtttccaga 1020 

tctcaaaggc accaccagcc tggagatcct gaccctgacc cgcgcaggca tccggctgct 1080 

cccatcgggg atgtgccaac agctgcccag gctccgagtc ctggaactgt ctcacaatca 1140 

aattgaggag ctgcccagcc tgcacaggtg tcagaaattg gaggaaatcg gcctccaaca 1200 

caaccgcatc tgggaaattg gagctgacac cttcagccag ctgagctccc tgcaagccct 1260 

ggatcttagc tggaacgcca tccggtccat ccaccctgag gccttctcca ccctgcactc 1320 

cctggtcaag ctggacctga cagacaacca gctgaccaca ctgcccctgg ctggacttgg 1380 

gggcttgatg catctgaagc tcaaagggaa ccttgctctc tcccaggcct tctccaagga 1440 

cagtttccca aaactgagga tcctggaggt gccttatgcc taccagtgct gtccctatgg 1500 

■gatgtgtgcc agcttcttca aggcctctgg gcagtgggag gctgaagacc ttcaccttga 1560 

tgatgaggag tcttcaaaaa ggcccctggg cctccttgcc agacaagcag agaaccacta 1620 

tgaccaggac ctggatgagc tccagctgga gatggaggac tcaaagccac accccagtgt 1680 

ccagtgtagc cctactccag gccccttcaa gccctgtgag tacctctttg aaagctgggg 1740 

catccgcctg gccgtgtggg ccatcgtgtt gctctccgtg ctctgcaatg gactggtgct 1800 

gctgaccgtg ttcgctggcg ggcctgcccc cctgcccccg gtcaagtttg tggtaggtgc 1860 

gattgcaggc gccaacacct tgactggcat ttcctgtggc cttctagcct cagtcgatgc 1920 

cctgaccttt ggtcagttct ctgagtacgg agcccgctgg gagacggggc taggctgccg 1980 

ggccactggc ttcctggcag tacttgggtc ggaggcatcg gtgctgctgc tcactctggc 2040 

cgcagtgcag tgcagcgtct ccgtctcctg tgtccgggcc tatgggaagt ccccctccct 2100 

gggcagcgtt cgagcagggg tcctaggctg cctggcactg gcagggctgg ccgccgcact 2160 

gcccctggcc tcagtgggag aatacggggc ctccccactc tgcctgccct acgcgccacc 2220 

tgagggtcag ccagcagccc tgggcttcac cgtggccctg gtgatgatga actccttctg 2280 

tttcctggtc gtggccggtg cctacatcaa actgtactgt gacctgccgc ggggcgactt 2340 

tgaggccgtg tgggactgcg ccatggtgag gcacgtggcc tggctcatct tcgcagacgg 2400 

gctcctctac tgtcccgtgg ccttcctcag cttcgcctcc atgctgggcc tcttccctgt 2460 

cacgcccgag gccgtcaagt ctgtcctgct ggtggtgctg cccctgcctg cctgcctcaa 2520 

cccactgctg tacctgctct tcaaccccca cttccgggat gaccttcggc ggcttcggcc 2580 

ccgcgcaggg gactcagggc ccctagccta tgctgcggcc ggggagctgg agaagagctc 2640 

ctgtgattct acccaggccc tggtagcctt ctctgatgtg gatctcattc tggaagcttc 2700 

tgaagctggg cggccccctg ggctggagac ctatggcttc ccctcagtga ccctcatctc 2760 

ctgtcagcag ccaggggccc ccaggctgga gggcagccat tgtgtagagc cagaggggaa 2820 

ccactttggg aacccccaac cctccatgga tggagaactg ctgctgaggg cagagggatc 2880 

tacgccagca ggtggaggct tgtcaggggg tggcggcttt cagccctctg gcttggcctt 2940 

tgcttcacac gtgctcgagc aaaagttgat ttctgaagaa gatttgaacg gtgaacaaaa 3000 

gctaatctcc gaggaagact tgaacggtga acaaaaatta atctcagaag aagacttgaa 3060 



wo 03/000012 



105 



PCT/US02/19773 



cggatcatag atctctaatt 
agggcccgga aacctggccc 
gccaaaggaa tgcaaggtct 
tgaagacaaa caacgtctgt 
aggtgcctct gcggccaaaa 
cagtgccacg ttgtgagttg 
ttcaacaagg ggctgaag 



ccggttattt tccaccatat 
tgtcttcttg acgagcattc 
gttgaatgtc gtgaaggaag 
agcgaccctt tgcaggcagc 
gccacgtgta taagatacac 
gatagttgtg gaaagagtca 



tgccgtcttt tggcaatgtg 3120 
ctaggggtct ttcccctctc 3180 
cagttcctct ggaagcttct 3240 
ggaacccccc acctggcgac 3300 
ctgcaaaggc ggcacaaccc 3360 
aatggctctc ctcaagcgta 3420 

3438 



<210> 90 
<211> 1005 
<212> PRT 

<213> Homo sapiens 



<400> 90 

Met Pro Ser Pro 

1 

Cys Ala Ser Arg 
20 

Thr Ala Cys Pro 
35 

Ser Ala Asp Cys 

50 

Asp Pro Leu Thr 
65 

Leu Gin Pro Gly 

Leu Ser Gly Asn 
100 

Leu Tyr Ser Leu 
115 

He Pro Ala Glu 

130 

Leu Asp Ala Asn. 
145 

Leu Ser Ser Leu 

He Pro Val Arg 
180 

Leu Ala Leu Asn 
195 

Leu Thr Ser Leu 

210 

Leu Gly Thr His 
225 

Leu Asn Tyr Asn 

Gly Arg Leu Gin 
260 

Pro Glu Lys Ala 
275 

Tyr Asp Asn Pro 
290 

Pro Lys Leu His 
305 

Phe Pro Asp Leu 

Arg Ala Gly He 
340 

Arg Leu Arg Val 



Pro 


uxy 


Leu 


Arg 


c 
D 








Arg 


Ala 


Gly 


Gly 


Ala 


Pro 


Cys 


JilS 










Ser 


Glu 


Leu 


Gly 










TV 1 

Ala 


Tyr 


Leu 


Asp 




/U 






Leu 


rue 


lilS 


HIS 










His 


Leu 


Ser 


His 


Lys 


lie 


Leu 


Kie u 








l^U 


Ala 


Leu 


Trp 


blU 










Leu 


lie 


oer 


Leu 




1 ou 






Arg 




Leu 


irp 


JL DO 








Ala 


Leu 


Asn 


Asn 


Arg 


T 1 /?i 

lie 


ber 


HIS 








200 


Val 


Val 


Leu 


His 






215 




Ser 


Phe 


Glu 


Gly 




230 






Lys 


Leu 


Gin 


Glu 


245 








Glu 


Leu 


Gly 


Phe 


Phe 


Met 


Gly 


Asn 








280 


He 


Gin 


Phe 


Val 






295 




Thr 


Leu 


Ser 


Leu 




310 






Lys 


Gly 


Thr 


Thr 


325 








Arg 


Leu 


Leu 


Pro 


Leu 


Glu 


Leu 


Ser 



Ala 


Leu 


Trp 


Leu 




1 u 






Ai a 


Pro 


bin 


Pro 


Z 0 








Cys 


Gin 


Glu 


Asp 


Leu 


Ser 


Ala 


Val 








oU 


Leu 


Ser 


Met 


Asn 






/ 0 




Leu 


Arg 


irne 


Leu 




Q n 
y u 






i le 


Pro 


bly 


Gin 


1 n R 
iUO 








Leu 


Gin 


Asn 


Asn 


Leu 




OCX 


Leu 








J.H\J 


V d J. 


£r X. U 


m n 

J. li 


Arg 










Leu 


Asp 


Asp 


Asn 










Le u 


Pro 


Ala 




X o o 








lie 


Pro 


Asp 


Tyr 


Leu 


His 


Asn 


Asn 








220 


Leu 


His 


Asn 


Leu 






235 




Phe 


Pro 


Val 


Ala 




250 






His 


Asn 


Asn 


Asn 


265 








Pro 


Leu 


Leu 


Gin 


Gly 


Arg 


Ser 


Ala 








300 


Asn 


Gly 


Ala 


Met 






315 




Ser 


Leu 


Glu 


He 




330 






Ser 


Gly 


Met 


Cys 


345 








His 


Asn 


Gin 


He 



Cys 


Ala 


Ala 


Leu 






15 




Gly 


Pro 


Gly 


Pro 










Gly 


He 


Met 


Leu 


45 








Pro 


Gly 


Asp 


Leu 


TV _ 

Asn 


Leu 


rp u, ^ 

inr 


blU 








oU 


blU 


blU 


Leu 


Arg 






y 0 




Ala 


Phe 


Ser 


Gly 




Tin 
110 






Gin 


Leu 


Gly 


Gly 


1^0 








Gin 


Ser 


Leu 


Arg 


oer 


trne 


blU 


biy 








1 bU 


Ala 


Leu 


rn U, „ 

1 nr 


blU 






1/0 




bin 


Ala 


ixieti 


i nr 




IzfV 






TV 1 -» 

Ala 


Pne 


Gin 


Asn 


205 








Arg 


He 


Gin 


His 


Glu 


Thr 


Leu 


Asp 








240 


He 


Arg 


Thr 


Leu 






255 




He 


Lys 


Ala 


He 




270 






Thr 


He 


His 


Phe 


285 








Phe 


Gin 


Tyr 


Leu 


Asp 


He 


Gin 


Glu 








320 


Leu 


Thr 


Leu 


Thr 






335 




Gin 


Gin 


Leu 


Pro 




350 






Glu 


Glu 


Leu 


Pro 
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360 


Ser 


Leu 


nlS 


Arg 


Cys 


Gin 


Lys 


Leu 




J / u 










375 




Arg 


T 1 *a 

lie 


Trp 


Pill 
bxU 


lie 


Gly Ala Asp 












390 








nXaL 




Asp 


Leu 


Ser 


Trp 


Asn 










405 








Aid 


rne 


Ser 


Thr 


Leu 


His 


Ser 


Leu 








420 










Gin 


Leu 


X nr 


Thr 


Leu 


Pro 


Leu 


Ala 






HOD 










440 


Lys 


Leu 


Lys 


Gly Asn 


Leu 


Ala 


Leu 




450 










455 




Phe 


Pro 


Lys 


Leu 


Arg 


He 


Leu 


Glu 


465 










470 






Pro Tyr 


bXy 


Met 


Cys 


Ala 


Ser 


Phe 










485 








Ala 


Glu 


Asp 


Leu 


His 


Leu Asp Asp 








500 










Gl-y Leu 


Leu 


Ala 


Arg 


Gin 


Ala 


Glu 






etc 










520 


Glu 


Leu 




Leu 


Glu 


Met 


Glu 


Asp 




530 










535 




Cys 


Ser 


Pro 


Thr 


Pro 


Gly 


Pro 


Phe 


545 










550 






Ser Trp 


KD±y 


lie 


Arg 


Leu 


Ala 


Val 










565 








Leu Cys 


Asn 


Gly 


Leu 


Val 


Leu 


Leu 








580 










Pro 


Leu 


Pro 


Pro 


Val 


Lys 


Phe 


Val 
















600 


Thr 


Leu 


Thr 


bly 


lie 


Ser 


Cys 


Gly 




610 










615 




Thr 


Phe 


taXy 


Gin 


Phe . 


C A 1^ 

. Ser 


Glu 


Tyr 


625 










630 






Gly Cys 


Arcf 


Ala 


Thr 


Gly 


Phe 


Leu 










645 








Val 


Leu 


Leu 


Leu 


Thr 


Leu 


Ala 


Ala 








660 










Cys 


Val 


Arg 


H.±a 


Tyr 


Gly 


Lys 


Ser 






fin 










680 


Gly Val 


Leu 


Gly 


Cys 


Leu 


Ala 


Leu 




690 










695 




Leu 


Ala 




Val 


Gly 


Glu 


Tyr Gly 


705 










710 






Ala 


Pro 


Pro 


Glu 


Gly 


Gin 


Pro 


Ala 










725 








Val 


Met 


Met. 


Asn 


Ser 


Phe 


Cys 


Phe 








740 










Lys 


Leu 


Tyr 


Cys 


Asp 


Leu 


Pro Arg 






/ DD 










7 60 


Cys 


Ala 




Val 


Arg 


His 


Val 


Ala 




770 










775 




Leu Tyr 


Cys 


Pro 


Val 


Ala 


Phe 


Leu 


785 










790 






Phe 


Pro 


Val 


Thr 


Pro 


Glu 


Ala 


Val 










805 








Pro 


Leu 


Pro 


Ala 


Cys 


Leu 


Asn 


Pro 



820 



365 



Glu 


Glu 


T 1 ^ 

He 


Gly 


Leu 


Gin 


His 


Asn 


















Thr 


Pne 


Ser 


Gin 


Leu 


Ser 


Ser 


Leu 






jy 0 










400 


TV T 

Ala 


He 


Arg 


Ser 


He 


His 


Pro 


Glu 




4 1 n 

fl lU 










415 




Val 


Lys 


Leu 


TV 

Asp 


Leu 


Thr 


Asp 


Asn 


A 0 C, 










430 






Gly 


Leu 


Gly 


Gly 


Leu 


Met 


His 


Leu 










445 








Ser 


Gin 


Ala 


Phe 


Ser 


Lys 


Asp 


Ser 








460 










Val 


Pro 


Xyr 


Ala 


Tyr 


Gin 


Cys 


Cys 






475 










480 


Pne 


Lys 


Ala 


Ser 


Gly 


Gin 


Trp 


Glu 




4 90 










4 95 




Glu 


Glu 


Ser 


Ser 


Lys 


Arg 


Pro 


Leu 


505 










510 






Asn 


His 


Tyr 


Asp Gin 


Asp 


Leu 


Asp 










525 








Ser 


Lys 


Pro 


His 


Pro 


Ser 


Val 


Gin 








540 










Lys 


Pro 


Cys 


Glu 


Tyr 


Leu 


Phe 


Glu 






555 










560 


Trp 


Ala 


He 


Val 


Leu 


Leu 


Ser 


Val 




570 










575 




Thr 


Val 


Phe 


Ala 


Gly 


Gly 


Pro 


Ala 


585 










590 






Val 


Gly 


Ala 


He 


Ala 


Gly 


Ala 


Asn 










605 








Leu 


Leu 


Ala 


Ser 


Val 


Asp 


Ala 


Leu 








620 










Gly 


Ala 


Arg 


Trp 


Glu 


Thr 


Gly 


Leu 






635 










640 


Ala 


Val 


Leu 


Gly 


Ser 


Glu 


Ala 


Ser 




650 










655 




Val 


Gin 


Cys 


Ser 


Val 


Ser 


Val 


Ser 


665 










670 






Pro 


Ser 


Leu 


Gly Ser 


Val 


Arg 


Ala 










685 








Ala 


Gly 


Leu 


Ala 


Ala 


Ala 


Leu 


Pro 








700 










Ala 


Ser 


Pro 


Leu 


Cys 


Leu 


Pro 


Tyr 






Tic; 

f ID 












Ala 


Leu 


r*i WW 

Gly 


Phe 


Thr 


Val 


TV 1 _ 

Ala 


Leu 




T O 

730 










T O C 

735 




Leu 


Val 


Val 


Ala 


Gly 


Ala 


Tyr 


He 


74 5 










750 






Gly 


Asp 


Phe 


Glu 


Ala 


Val 


Trp 


Asp 










765 








Trp 


Leu 


He 


Phe 


Ala 


Asp 


Gly 


Leu 








780 










Ser 


Phe 


Ala 


Ser 


Met 


Leu 


Gly 


Leu 






795 










800 


Lys 


Ser 


Val 


Leu 


Leu 


Val 


Val 


Leu 




810 










815 




Leu 


Leu 


Tyr 


Leu 


Leu 


Phe 


Asn 


Pro 


825 










830 
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13 i e* 

nlS 


triic 


Tim 

jTVX ^ 




Map jjeu 


Arg 


Arg 


Leu 


Arg 


rro 


Arg 


Ala 


Gly 


Asp 


Ser 














0 ft u 










O A C 

845 










T.ai 1 


A1 a 


lyr n±.a 




a 1 a 

Axa 


u±y 


Pill 


Leu 


bXU 


Lys 


Ser 


Ser 


Cys 




o 3 U 








fi c; R 










o oU 






Asp 


Cat* 


Th y 




Ala Leu 


\7a 1 

va J. 






oer 


ASp 


vax 


Asp 


Leu 


He 


Leu 


ODD 








Q "7 n 










QIC 








880 




HXa 




o J. Ui 


Ala Gly 


Arg 


Pro 


Pro 


p 1 \/ 
oxy 


Leu 


OjXU 


Thr 


Tyr 


Gly 


Phe 










885 
















895 




Pro 


Cor* 


1 

V aX 


Th-r 


Leu lie 


Ser 


Cys 


Pin 

bxn 


PI « 
uxn 


rro 


CaXy 


TV T 

Ala 


Pro 


Arg 


Leu 








Qnn 


















910 




Glu 


Gly 


Ser 


His 


Cys Val 


Glu 


Pro 


Glu 


Gly 


Asn 


His 


Phe 


Gly 


Asn 


Pro 






Q1 ^ 
7lO 








920 










925 








ra -r- r\ 


O ^ -y" 




Asp Gly Glu Leu 


Leu 


Leu 


Arg 


AXa 


Glu 


Gly 


Ser 


Thr 












935 










940 








Pro 






PI 1/ 

Vjj.y 


Gly Leu 


Ser 


Gly 


biy 


bxy 


bxy 


Pne 


Gin 


Pro 


Ser 


Gly 










950 










n c c 










960 


Leu 


Ala 


Phe 


Ala 


Ser His 

965 


Val 


Leu 


Glu 


Gin 
970 


Lys 


Leu 


He 


Ser 


Glu 
975 


Glu 


Asp 


Leu 


Asn 


Gly 


Glu Gin 


Lys 


Leu 


He 


Ser 


Glu 


Glu 


Asp 


Leu 


Asn 


Gly 








980 








985 










990 




Glu 


Gin 


Lys 


Leu 


He Ser 


Glu 


Glu 


Asp 


Leu 


Asn 


Gly 


Ser 












995 








1000 








1005 







<210> 91 
<211> 2196 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> 1952, 1960 
<223> n = A,T,C or G 

<400> 91 

aaacacactc agcccttgca ctgacctgcc ttctgattgg aggctggttg cttcggataa 60 
tgacctccag gaccccactg ttggttacag cctgtttgta ttattcttac tgcaactcaa 120 
gacacctgca gcagggcgtg agaaaaagta aaagaccagt attttcacat tgccaggtac 180 
cagaaacaca gaagactgac acccgccact taagtggggc cagggctggt gtctgcccat 240 
gttgccatcc tgatgggctg cttgccacaa tgagggatct tcttcaatac atcgcttgct 300 
tctttgcctt tttctctgct gggtttttga ttgtggccac ctggactgac tgttggatgg 360 
tgaatgctga tgactctctg gaggtgagca caaaatgccg aggcctctgg tgggaatgcg 420 
tcacaaatgc ttttgatggg attcgcacct gtgatgagta cgattccata cttgcggagc 4 80 
atcccttgaa gctggtggta actcgagcgt tgatgattac tgcagatatt ctagctgggt 540 
ttggatttct caccctgctc cttggtcttg actgcgtgaa attcctccct gatgagccgt 600 
acattaaagt ccgcatctgc tttgttgctg gagccacgtt actaatagca ggtaccccag 660 
gaatcattgg ctctgtgtgg tatgctgttg atgtgtatgt ggaacgttct actttggttt 720 
tgcacaatat atttcttggt atccaatata aatttggttg gtcctgttgg ctcggaatgg 780 
ctgggtctct gggttgcttt ttggctggag ctgttctgac ctgctgctta tatcttttta 840 
aagatgttgg acctgagaga aactatcctt attccttgag gaaagcctat tcagccgcgg 900 
gtgtttccat ggccaagtca tactcagccc ctcgcacaga gacggccaaa atgtatgctg 960 
tagacacaag ggtgtaaaat gcacgtttca gggtgtgttt gcatatgatt taatcaatca 1020 
gtatggttac attgataaaa tagtaagtca atccaggaac agttatttag aattcatatt 1080 
gaattaaatt aattgctagc ttaatcaaaa tgtttgattc tcctatactt tttctttcta 1140 
ttactcttat attttcccgt cattctctct gctaaccttc caccttatgc acacactttc 1200 
cctatatttt aagataagtc tgctaggatg tagaaatatt tgtttgtgat ttctatatag 1260 
ctattagaga ttatgacata gtaatattaa aatgaaatga tacttaaaca gaaagcaatt 1320 
tccaaagagg ccagggaccc taatctttga agagatgaag aaacttactt ttctccctgg 1380 
cttttggttc actttttgta cttttaacaa gtgggtgaat tatttgataa ttttgaggaa 1440 
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gattattctt 
aacagaaaaa 
ctgccccccc 
actttctacc 
tgctgagata 
tattgattaa 
gcccacacac 
caaatatatt 
cggctgggca 
atcacgaggt 
aaaatacaaa 
tgaggcagga 
attgcactcc 



ttaaattcaa 
ggaaataaca 
acccccaaat 
tcactaacaa 
ttgccaaatg 
acatccaaat 
cactgaatca 
ttgatataaa 
cggtagctcg 
caagagacgg 
aattagccgg 
gaatcgtttg 
agtctggcaa 



actagtatgt 
acttcgtata 
gtcaaaggca 
cataagggat 
attagactac 
actattgtaa 
tcagcaccaa 
taaataaacg 
cgtctgtaat 
agaccatcct 
gcgtggtggc 
aacccaggag 
cagagcgaga 



caatgcctac 
ccagccactg 
aatgctaaat 
ctccatatta 
acaatagttc 
tatactatgt 
gcaatatatt 
ttcacgactt 
cnccgcactn 
ggctaacatg 
ggtgcctgta 
gtggaggttg 
ctccat 



cattactctg 
gtgagagtta 
tgatactgga 
tttcaccact 
aaccagagaa 
taaaattcat 
agacatatgg 
tacttaaaaa 
tgggaggcca 
gtgaaaccct 
gtcccagcta 
cagtgagcgg 



attatattaa 
aagacaagag 
gctcgtggtg 
attctagctt 
tttactcatt 
caattcaagt 
caaaattcaa 
atcaatgttg 
aggcgggtgg 
gtctctacta 
ctcgggaggc 
agatcgcacc 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2196 



<210> 92 

<211> 305 

<212> PRT 

<213> Homo sapiens 



<400> 92 
Met Thr Ser Arg 
1 

Tyr Cys Asn Ser 
20 

Pro Val Phe Ser 
35 

Arg His Leu Ser 
50 

Asp Gly Leu Leu 
65 

Phe Phe Ala Phe 

Asp Cys Trp Met 
100 

Cys Arg Gly Leu 
115 

Arg Thr Cys Asp 
130 

Leu Val Val Thr 
145 

Phe Gly Phe Leu 

Pro Asp Glu Pro 
180 

Thr Leu Leu lie 

195 

Ala Val Asp Val 
210 

Phe Leu Gly lie 
225 

Ala Gly Ser Leu 

Leu Tyr Leu Phe 
260 

Leu Arg Lys Ala 
275 

Ser Ala Pro Arg 
290 

Val 



Thr Pro 
5 

Arg His 

His Cys 

Gly Ala 

Ala Thr 

70 
Phe Ser 
85 

Val Asn 
Trp Trp 

Glu Tyr 

Arg Ala 
150 
Thr Leu 
165 

Tyr lie 
Ala Gly 

Tyr Val 

Gin Tyr 
230 
Gly Cys 
245 

Lys Asp 
Tyr Ser 
Thr Glu 



Leu Leu 

Leu Gin 

Gin Val 
40 

Arg Ala 
55 

Met Arg 

Ala Gly 

Ala Asp 

Glu Cys 
120 
Asp Ser 
135 

Leu Met 
Leu Leu 
Lys Val 

Thr Pro 

200 
Glu Arg 
215 

Lys Phe 

Phe Leu 

Val Gly 

Ala Ala 
280 
Thr Ala 
295 



Val Thr Ala 
10 

Gin Gly Val 
25 

Pro Glu Thr 

Gly Val Cys 

Asp Leu Leu 
75 

Phe Leu lie 

90 

Asp Ser Leu 
105 

Val Thr Asn 

lie Leu Ala 

He Thr Ala 
155 

Gly Leu Asp 

170 
Arg He Cys 
185 

Gly He He 

Ser Thr Leu 

Gly Trp Ser 
235 

Ala Gly Ala 

250 
Pro Glu Arg 
265 

Gly Val Ser 
Lys Met Tyr 



Cys Leu 

Arg Lys 

Gin Lys 

45 
Pro Cys 
60 

Gin Tyr 
Val Ala 
Glu Val 

Ala Phe 

125 
Glu His 
140 

Asp He 
Cys Val 
Phe Val 

Gly Ser 

205 
Val Leu 
220 

Cys Trp 

Val Leu 

Asn Tyr 

Met Ala 
285 
Ala Val. 
300 



Tyr Tyr Ser 
15 

Ser Lys Arg 
30 

Thr Asp Thr 

Cys His Pro 

He Ala Cys 
80 

Thr Trp Thr 
95 

Ser Thr Lys 
110 

Asp Gly He 

Pro Leu Lys 

Leu Ala Gly 
160 

Lys Phe Leu 
175 

Ala Giy Ala 
190 

Val Trp Tyr 

His Asn He 

Leu Gly Met 
240 

Thr Cys Cys 

255 
Pro Tyr Ser 
270 

Lys Ser Tyr 
Asp Thr Arg 
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305 



<210> 93 
<211> 7460 

<212> DNA 

<213> Homo sapiens 
<400> 93 

cgtccgggag aacgctgcag aaattctcat 
gaaaactcag gcagtggatt tcatcaacta 
gtggaagtca atggtggtgt cacatcccca 
ttcagcacag tgcccttttc tgggaccccc 
tgcttgttgt aagactccgt ttaatttcca 
caccaggtag acagcgcaat ctgtggagct 
ttgtggcccc agacttttaa aacagcacta 
atgaaatgaa aaccctgttg ctgcgtcact 
ctgtggggat tcagtttagg cgctggcccg 
acctgtctgc atctgactga gccggctctc 
ggcggcgcca cctgcagtag cgttcgcccg 
tcgcgactcc cgttcctatt caccctggtc 
gtctggacgc agaggctgca cggcggcagc 
gtggtgcagg gcgacccgcg cgagctgcgg 
agccgcgcgg acgagaagcc gctccggagg 
atcaaggtgt acggacaggt tagtctgaat 
gctggagaga aaagcaacgt gatcgtggcc 
cccaagagca gtgatgtgta cgtgtcttac 
gacaagttaa actttggctt gggaaatagg 
agccctgcgg acaacaagcg gtacatcttt 
acgtttgact tctgcaacac tcttcaaggc 
ctcctacaca gtaaggcctc caaccttctc 
cagctgtgga agtcagatga ctttggccag 
tccttttctt ggggaattga tccctatgac 
gaaccctctg gctactccac tgtcttccga 
caggaagtga tccttgagga agtgagagat 
acaaaggtgg tgcatctctt gggcagtgaa 
tttggccgga agcccatgag agcagcccag 
tacatcgcag atgcctccga ggaccaggtg 
accaatttat acatctcaga ggcagagggg 
ctctattaca gcccaggagg ggccggcagt 
ccatttgctg acttccaccg agtggaagga 
aatggttcta tgaatgagga gaacatgaga 
tgggagtttc ttcaggctcc agccttcacg 
tcccagggct gttcccttca tctggctcag 
cggagaatgc ccatcctgtc caaggagtcg 
gtgggaaaga acttggctag caagacaaac 
tggcgagagg cacttcctgg acctcactac 
acggccattg cccagggcat ggaaaccaac 
acctggaaaa cattcatctt ctctgagaag 
cctggggaga agagcactgt cttcaccatc 
tggctgatcc tccaggtcaa tgccacggat 
tacaagctgt ggtcaccatc tgatgagcgg 
gttttcaaac ggcggacccc ccatgccaca 
gtggtcgtgt ccaactgctc ctgcacccgg 
atgagtgaag atttgtcatt agaggtttgt 
tactcccctc ctgtgccttg ccctgtgggt 
aagatttctg gggacacttg tagcggagga 
gtcccctgtc ccctggcaga agagaacgag 
taccgctatg acctggcctc gggagccacc 



cctggccgac ctccacagtg cagatcagtt 60 
tcatgcttcg gatgtcttgg agacctctgg 120 
cttggtggct gaggcatacc gctctctggc 180 
acgcaaacgc ctgaagcaat cctaagatcc 240 
gaagcagcag ccactgttgc tgccactgac 300 
tttactctgt tgtgagggga agagactgca 360 
aataacttgg gggaaacggg gggagggaaa 420 
gtgttccctt tggcctggct gagtttgata 480 
aggatatccc agcggtggta cttcggagac 54 0 
ctggcctcgc gctgcacatt ctctcctggc 600 
aacatggcga cacggagcag caggagggag 660 
gcactgctgc cgcccggagc tctctgcgaa 720 
gcgcccttgc cccaggaccg gggcttcctc 780 
ctgtgggcgc gcggggatgc caggggggcg 840 
aaacggagcg ctgccctgca gcccgagccc 900 
gattcccaca atcagatggt ggtgcactgg 960 
ttggcccgag atagcctggc attggcgagg 1020 
gactatggaa aatcattcaa gaaaatttca 1080 
agtgaagctg ttatcgccca gttctaccac 1140 
gcagacgctt atgcccagta cctctggatc 1200 
ttttccatcc catttcgggc agctgatctc 1260 
ttgggctttg acaggtccca ccccaacaag 1320 
acctggatca tgattcagga acatgtcaag 1380 
aaaccaaata ccatctacat tgaacgacac 1440 
agtacagatt tcttccagtc ccgggaaaac 1500 
tttcagcttc gggacaagta catgtttgct 1560 
cagcagtctt ctgtccagct ctgggtctcc 1620 
tttgtcacaa gacatcctat taatgaatat 1680 
tttgtgtgtg tcagccacag taacaaccgc 1740 
ctgaagttct ccctgtcctt ggagaacgtg 1800 
gacaccttgg tgaggtattt tgcaaatgaa 1860 
ttgcaaggag tctacattgc tactctgatt 1920 
tcggtcatca cctttgacaa agggggaacc 1980 
ggatatggag agaaaatcaa ttgtgagctt 2040 
cgcctcagtc agctcctcaa cctccagctc 2100 
gctccaggcc tcatcatcgc cactggctca 2160 
gtgtacatct ctagcagtgc tggagccagg 2220 
tacacatggg gagaccacgg cggaatcatc 2280 
gagctaaaat acagtaccaa tgaaggggag 2340 
ccagtgtttg tgtatggcct cctcacagaa 2400 
tttggctcga acaaagagaa tgtccacagc 24 60 
gccttgggag ttccctgcac agagaatgac 2520 
gggaatgagt gtttgctggg acacaagact 2580 
tgcttcaatg gagaggactt tgacaggccg 2640 
gaggactatg agtgtgactt cggtttcaag 2700 
gttccagatc cggaattttc tggaaagtca 2760 
tctacttaca ggagaacgag aggctaccgg 2820 
gatgttgaag cgcgactgga aggagagctg 2880 
ttcattctgt atgctgtgag gaaatccatc 2940 
gagcagttgc ctctcaccgg gctacgggca 3000 
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gcagtggccc tggactttga ctatgagcac aactgtttgt attggtccga cctggccttg 3060 
gacgtcatcc agcgcctctg tttgaatgga agcacagggc aagaggtgat catcaattct 3120 
ggcctggaga cagtagaagc tttggctttt gaacccctca gccagctgct ttactgggta 3180 
gatgcaggct tcaaaaagat tgaggtagct aatccagatg gcgacttccg actcacaatc 3240 
gtcaattcct ctgtgcttga tcgtcccagg gctctggtcc tcgtgcccca agagggggtg 3300 
atgttctgga cagactgggg agacctgaag cctgggattt atcggagcaa tatggatggt 3360 
tctgctgcct atcacctggt gtctgaggat gtgaagtggc ccaatggcat ctctgtggac 3420 
gaccagtgga tttactggac ggatgcctac ctggagtgca tagagcggat cacgttcagt 3480 
ggccagcagc gctctgtcat tctggacaac ctcccgcacc cctatgccat tgctgtcttt 3540 
aagaatgaaa tctactggga tgactggtca cagctcagca tattccgagc ttccaaatac 3600 
sgtgggtccc agatggagat tctggcaaac cagctcacgg ggctcatgga catgaagatt 3660 
ttctacaagg ggaagaacac tggaagcaat gcctgtgtgc ccaggccatg cagcctgctg 3720 
tgcctgccca aggccaacaa cagtagaagc tgcaggtgtc cagaggatgt gtccagcagt 3780 
gtgcttccat caggggacct gatgtgtgac tgccctcagg gctatcagct caagaacaat 384 0 
acctgtgtca aagaagagaa cacctgtctt cgcaaccagt atcgctgcag caacgggaac 3900 
tgtatcaaca gcatttggtg gtgtgacttt gacaacgact gtggagacat gagcgatgag 3960 
agaaactgcc ctaccaccat ctgtgacctg gacacccagt ttcgttgcca ggagtctggg 4020 
acttgtatcc cactgtccta taaatgtgac cttgaggatg actgtggaga caacagtgat 4080 
gaaagtcatt gtgaaatgca ccagtgccgg agtgacgagt acaactgcag ttccggcatg 4140 
tgcatccgct cctcctgggt atgtgacggg gacaacgact gcagggactg gtctgatgaa 4200 
gccaactgta ccgccatcta tcacacctgt gaggcctcca acttccagtg ccgaaacggg 4260 
cactgcatcc cccagcggtg ggcgtgtgac ggggatacgg actgccagga tggttccgat 4 320 
gaggatccag tcaactgtga gaagaagtgc aatggattcc gctgcccaaa cggcacttgc 4 380 
atcccatcca gcaaacattg tgatggtctg cgtgattgct ctgatggctc cgatgaacag 4440 
cactgcgagc ccctctgtac gcacttcatg gactttgtgt gtaagaaccg ccagcagtgc 4500 
ctgttccact ccatggtctg tgacggaatc atccagtgcc gcgacgggtc cgatgaggat 4560 
gcggcgtttg caggatgctc ccaagatcct gagttccaca aggtatgtga tgagttcggt 4 620 
ttccagtgtc agaatggagt gtgcatcagt ttgatttgga agtgcgacgg gatggatgat 4 680 
tgcggcgatt attctgatga agccaactgc gaaaacccca cagaagcccc aaactgctcc 4740 
cgctacttcc agtttcggtg tgagaatggc cactgcatcc ccaacagatg gaaatgtgac 4800 
agggagaacg actgtgggga ctggtctgat gagaaggatt gtggagattc acatattctt 4860 
cccttctcga ctcctgggcc ctccacgtgt ctgcccaatt actaccgctg cagcagtggg 4920 
acctgcgtga tggacacctg ggtgtgcgac gggtaccgag attgtgcaga tggctctgac 4 980 
gaggaagcct gccccttgct tgcaaacgtc actgctgcct ccactcccac ccaacttggg 5040 
cgatgtgacc gatttgagtt cgaatgccac caaccgaaga cgtgtattcc caactggaag 5100 
cgctgtgacg gccaccaaga ttgccaggat ggccgggacg aggccaattg ccccacacac 5160 
agcaccttga cttgcatgag cagggagttc cagtgcgagg acggggaggc ctgcattgtg 5220 
ctctcggagc gctgcgacgg cttcctggac tgctcggacg agagcgatga aaaggcctgc 5280 
agtgatgagt tgactgtgta caaagtacag aatcttcagt ggacagctga cttctctggg 534 0 
gatgtgactt tgacctggat gaggcccaaa aaaatgccct ctgcatcttg tgtatataat 5400 
gtctactaca gggtggttgg agagagcata tggaagactc tggagaccca cagcaataag 54 60 
acaaacactg tattaaaagt cttgaaacca gataccacgt atcaggttaa agtacaggtt 5520 
cagtgtctca gcaaggcaca caacaccaat gactttgtga ccctgaggac cccagaggga 5580 
ttgccagatg cccctcgaaa tctccagctg tcactcccca gggaagcaga aggtgtgatt 5640 
gtaggccact gggctcctcc catccacacc catggcctca tccgtgagta cattgtagaa 5700 
tacagcagga gtggttccaa gatgtgggcc tcccagaggg ctgctagtaa ctttacagaa 5760 
atcaagaact tattggtcaa cactctatac accgtcagag tggctgcggt gactagtcgt 5820 
ggaataggaa actggagcga ttctaaatcc attaccacca taaaaggaaa agtgatccca 5880 
ccaccagata tccacattga cagctatggt gaaaattatc taagcttcac cctgaccatg 594 0 
gagagtgata tcaaggtgaa tggctatgtg gtgaaccttt tctgggcatt tgacacccac 6000 
aagcaagaga ggagaacttt gaacttccga ggaagcatat tgtcacacaa agttggcaat 6060 
ctgacagctc atacatccta tgagatttct gcctgggcca agactgactt gggggatagc 6120 
cctctggcat ttgagcatgt tatgaccaga ggggttcgcc cacctgcacc tagcctcaag 6180 
gccaaagcca tcaaccagac tgcagtggaa tgtacctgga ccggcccccg gaatgtggtt 624 0 
tatggtattt tctatgccac gtcctttctt gacctctatc gcaacccgaa gagcttgact 6300 
acttcactcc acaacaagac ggtcattgtc agtaaggatg agcagtattt gtttctggtc 6360 
cgtgtagtgg taccctacca ggggccatcc tctgactacg ttgtagtgaa gatgatcccg 6420 
gacagcaggc ttccaccccg tcacctgcat gtggttcata cgggcaaaac ctccgtggtc 6480 
atcaagtggg aatcaccgta tgactctcct gaccaggact tgttgtatgc aattgcagtc 654 0 
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aaagatctca taagaaagac 
gaatacaccc ttaacaagtt 
aacatgagca aagattccag 
ttaaaaatca taacagaaaa 
gaaaagcatt ttaatgaaag 
atcacagctt accttgggaa 
ggtcataatt acacgttcac 
gagcctgcca tcctgctgta 
gctgccagat ctacggatgt 
agcctggggg tggggtttgc 
accgccttcg ccaacagcca 
gatgacctgg gggaagatga 
cccatggtga tagcctgaaa 
tttgataaag atagttgatg 
ttttatatgg gccaaaaaca 
aactttgtag taatcaactg 

<210> 94 
<211> 2214 
<212> PRT 
<213> Homo sapiens 



<400> 94 



Met 


Ala 


rp -w- 

1 nr 


Arg 


Ser 


Ser 


Arg 


Arg 


Glu 


Ser 


Arg 


Leu 


Pro 


Phe 


Leu 


Phe 


1 


















lU 










lo 




Thr 


Leu 


vax 


I\±3L 


Leu 


Leu 


Pro 


Pro 


biy 


Aia 


Leu 


Cys 


blU 


Val 


Trp 


rri"U. 

Tnr 


































Gin Arg 


Leu 


niS 


ijj.y 


ijxy 


Ser 


TV 1 = 

Ala 


Pro 


lieu 


Pro 


bin 


Asp 


Arg 


Gly 


Pne 


































Leu 


Val 


vai 


bin 


vjly 


TV 

ASp 


Pro 


Arg 


Glu 


Leu 


Arg 


Leu 


Trp 


Ala 


Arg 


Gly 




50 










55 










60 










Asp 


Ala 


Arg 


Gly 


Ala 


Ser 


Arg 


Ala 


Asp 


Glu 


Lys 


Pro 


Leu 


Arg 


Arg 


Lys 


65 










70 










75 










80 


Arg 


Ser 


Ala 


Ala 


Leu 


Gin 


Pro 


Glu 


Pro 


He 


Lys 


Val 


Tyr 


Gly 


Gin 


Val 










85 










90 










95 




Ser 


Leu 


Asn 


Asp 


Ser 


His 


Asn 


Gin 


Met 


Val 


Val 


His 


Trp 


Ala 


Gly 


Glu 








100 










105 










110 






Lys 


Ser 


Asn 


Val 


He 


Val 


Ala 


Leu 


Ala 


Arg 


Asp 


Ser 


Leu 


Ala 


Leu 


Ala 






115 










120 










125 








Arg 


Pro 


Lys 


Ser 


Ser 


Asp 


Val 


Tyr 


Val 


Ser 


Tyr 


Asp 


Tyr 


Gly 


Lys 


Ser 




130 










135 










140 










Phe 


Lys 


Lys 


He 


Ser 


Asp 


Lys 


Leu 


Asn 


Phe 


Gly 


Leu 


Gly 


Asn 


Arg 


Ser 


145 










150 










155 










160 


Glu 


Ala 


Val 


He 


Ala 


Gin 


Phe 


Tyr 


His 


Ser 


Pro 


Ala 


Asp 


Asn 


Lys 


Arg 










165 










170 










175 




Tyr 


He 


Phe 


Ala 


Asp 


Ala 


Tyr 


Ala 


Gin 


Tyr 


Leu 


Trp 


He 


Thr 


Phe 


Asp 








180 










185 










190 






Phe 


Cys 


Asn 


Thr 


Leu 


Gin 


Gly 


Phe 


Ser 


He 


Pro 


Phe 


Arg 


Ala 


Ala 


Asp 






195 










200 










205 








Leu 


Leu 


Leu 


His 


Ser 


Lys 


Ala 


Ser 


Asn 


Leu 


Leu 


Leu 


Gly 


Phe 


Asp 


Arg 




210 










215 










220 










Ser 


His 


Pro 


Asn 


Lys 


Gin 


Leu 


Trp 


Lys 


Ser 


Asp 


Asp 


Phe 


Gly 


Gin 


Thr 


225 










230 










235 










240 


Trp 


He 


Met 


He 


Gin 


Glu 


His 


Val 


Lys 


Ser 


Phe 


Ser 


Trp 


Gly 


He 


Asp 










245 










250 










255 




Pro 


Tyr 


Asp 


Lys 


Pro 


Asn 


Thr 


He 


Tyr 


He 


Glu 


Arg 


His 


Glu 


Pro 


Ser 








260 










265 










270 






Gly Tyr 


Ser 


Thr 


Val 


Phe 


Arg 


Ser 


Thr 


Asp 


Phe 


Phe 


Gin 


Ser 


Arg 


Glu 






275 










280 










285 









tgacaggagc tacaaagtaa aatcccgtaa cagcactgtg 6600 
ggagcctggc gggaaatacc acatcattgt ccaactgggg 6660 
cataaaaatt accacagttt cattatcagc acctgatgcc 6720 
tgatcatgtt cttctgtttt ggaaaagcct ggctttaaag 6780 
caggggctat gagatacaca tgtttgatag tgccatgaat 684 0 
tactactgac aatttcttta aaatttccaa cctgaagatg 6900 
cgtccaagca agatgccttt ttggcaacca ga'tctgtggg 6960 
cgatgagctg gggtctggtg cagatgcatc tgcaacgcag 7020 
tgctgctgtg gtggtgccca tcttattcct gatactgctg 7080. 
catcctgtac acgaagcacc ggaggctgca gagcagcttc 714 0 
ctacagctcc aggctggggt ccgcaatctt ctcctctggg 7200 
tgaagatgcc cctatgataa ctggattttc agatgacgtc 7260 
gagctttcct cactagaaac caaatggtgt aaatatttta 7320 
gtttatttta aaagatgcac tttgagttgc aatatgttat 7380 
aaaaacaaaa aaaaaaaaaa agggcggccg cgaatgaata 7440 

7460 
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Asn 


^1 n 


Glu 


V CIX 


Tl o 

XXc 


Leu 


o±U 


V3±U 


















T \7C 

jjy i3 


Tvr 
xyx 


Met 


Phe 


Ala 


Thr 


T.t/e 
xiy 9 


Va 1 
vox 












JJ-KJ 








^er 

OCX 


Ser 


Val 

vox 


oxu 


T.A11 
XJCU 


T rr» 


Va 1 
vax 




















Ala 


Gin 


Phe 


Val 


Thr 
1 fix 


Arg 


Hi 
nx o 








340 










Asp 


Ala 


Ser 


Glu 


Asp 


OXll 


Val 


Phe 
















o ou 


AlTQ 


Thr 


Asn 


Leu 


Tvr 
1 yx 


lie 


Ser 


Glu 




370 














Ser 


Leu 


ox u 




Va 1 
V a X 




Tyr 


lyr 


JO 
















1 fix 


XJCti 


V d X 




lyr 


Pho 
rne 


ai a 

MXo 


Asn 










4 (TS 








V Ct J. 


Glu 


Gly 


j-ieu 


ox 1 1 


ox y 


V Ci X 


i yr 








4 ^ u 










rie L 




oX u 


ni n 
o J. u 


Asn 


lYie L 


Arg 


Ser 






H.JJ 








4 4 0 


Thr 

X 111. 


Trn 


ox u 


Ph«» 

IT IIXS 


XiC u 


fil n 

OXil 


ai a 

riXcl 


trx O 




4 J V/ 














Tip 
X X c:; 


Asn 




ox u 


Leu 


OCX 


Cl fi 

oxn 


oxy 


*s D J 










*i f SJ 






Leu 


Ser 


Gin 


Leu 


Leu 


noil 


Leu 


OXll 










4 R t^i 

*1 O vJ 








Lys 


Glu 


Ser 


a 1 a 


fro 


oxy 


Leu 


-L xe 




















Leu 


Ala 


OCX 


Lys 


Thr 
1 IlX 


Asn 








515 












Apq 


Trp 


Arg 


ox u 


AX a 


Leu 


It X O 


Cl \/ 
oxy 




530 










R T R 




Ml c: 
n X o 


Gly Gly 


XX6 


Tl <3 

X xe 


Th y 
1 fix 


a 1 a 


T 1 a 

1 j.e 


















Leu 


Lys 


Tyr 


Q A r* 

0€x 


Thy 

i nx 


Asn 


Pill 
oJ.U 


o±y 


















Ser 


Glu 


Lys 


Pro 


Va 1 

V Cl X 


Phe 


Va 1 

vox 


i yr 








yJOKJ 










Lys 


Ser 


Thr 


Val 
vox 


Ph^ 


Thr 

1. 111. 


T 1 

X Xc 


Pho 






595 










ouu 


OCX 


Trp 


Leu 


Tl #a 
X xc 


XicU 


OXil 


Va 1 


Asn 




610 














Cys 


Thr 


Glu 


Asn 


Asp 


Tvr 


Lys 




















Asn 


Glu 


Cys 


XJCU 




f^l V 

oxy 


nxd 


Lys 










64 5 








His 


Ala 


Thr 


Cys 


Phe 


Asn 


Glv 

oxy 


Glu 








660 










Ser 


Asn 


Cys 


Ser 


Cys 


Thr 


Arg 


Glu 






675 










680 


Lys 


Met 


Ser 


Glu 


Asp 


Leu 


Ser 


Leu 




690 










695 




Phe 


Ser 


Gly 


Lys 


Ser 


Tyr 


Ser 


Pro 



705 710 
Thr Tyr Arg Arg Thr Arg Gly Tyr 
725 

Ser Gly Gly Asp Val Glu Ala Arg 
740 

Pro Leu Ala Glu Glu Asn Glu Phe 



Vax 


Arg Asp 


Phe 


bin 


Leu Arg Asp 








300 






vai 


His 


Leu 


Leu 


^1 mm 

Gly 


Ser Glu Gin 






315 






320 


Ser 


Phe Gly Arg 


iiys 


Pro Met Arg 




330 








335 


rr O 


He 


Asn 


Glu 


Tyr 


Tyr He Ala 


OHO 










350 


vajL 


Cys 


Val 


Ser 


His 


Ser Asn Asn 










365 




/\xa 


Glu Gly 


Leu 


Lys 


Phe Ser Leu 








380 






S^r 

OCX 


Pro Gly Gly 


Ala Gly Ser Asp 






395 






400 


oXU 


Pro 


Phe 


Ala 


Asp 


Phe His Arg 




410 








415 


T 1 

1 ±e 


Ala 


Thr 


Leu 


He 


Asn Gly Ser 


yl '3 










430 


val 


He 


Thr 


Phe 


Asp Lys Gly Gly 










445 




n 1 =1 

A J. a 


Phe 


Thr 


Gly 


Tyr Gly Glu Lys 








460 






Cys 


Ser 


Leu 


His 


Leu 


Ala Gin Arg 






475 






480 


Leu 


Arg Arg 


Met 


Pro 


He Leu Ser 




490 








495 


T 1 

lie 


Ala 


Thr 


Gly 


Ser 


Val Gly Lys 


R n Q 
Duo 










510 


Tyr 


He 


Ser 


Ser 


Ser 


Ala Gly Ala 










525 




D A 

rro 


His 


Tyr 


Tyr 


Thr 


Trp Gly Asp 








540 






Ala 


Gin 


Gly Met 


Glu 


Thr Asn Glu- 






555 






560 


blU 


Thr 


Trp 


Lys 


Thr 


Phe He Phe 




570 








575 


o± y 


Leu 


Leu 


Thr 


Glu 


Pro Gly Glu 


R Q c: 
00 0 










590 


r*! \7 
Liiy 


Ser 


Asn 


Lys 


Glu 


Asn Val His 










605 




Ala 


Thr Asp Ala 


Leu Gly Val Pro 








620 






Trp 


Ser 


Pro 


Ser 


Asp Glu Arg Gly 






635 






640 


i nr 


Val 


Phe 


Lys 


Arg Arg Thr Pro 




650 








655 


Asp 


Phe 


Asp 


Arg 


Pro 


Val Val Val 


D 0 0 










670 


a 

Mop 


Tyr 


Glu 


Cys 


Asp Phe Gly Phe 










685 




oXU 


Val 


Cys 


Val 


Pro Asp Pro Glu 








700 






Pro 


Val 


Pro 


Cys 


Pro Val Gly Ser 






715 






720 


Arg 


Lys 


He 


Ser 


Gly Asp Thr Cys 




730 








735 


Leu 


Glu Gly Glu 


Leu 


Val Pro Cys 


745 










750 


He 


Leu Tyr Ala 


Val 


Arg Lys Ser 
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755 






760 










765 








lie 


Tyr 


Arg 


Tyr 


Asp Leu 


Ala Ser 


Gly 


Ala 


Thr 


Glu 


Gin 


Leu 


Pro 


Leu 




770 








775 








780 










Thr 


Gly 


Leu 


Arg 


Ala Ala 


Val Ala 


Leu 


Asp 


Phe 


Asp Tyr 


Glu 


His 


Asn 


785 








790 








795 










800 


Cys 


Leu 


Tyr 


Trp 


Ser Asp 


Leu Ala 


Leu 


Asp 


Val 


He 


Gin 


Arg 


Leu 


Cys 










805 






810 










815 




Leu 


Asn 


Gly 


.oer 


Thr Gly Gin Glu 


Val 


He 


He 


Asn 


Ser 


Gly 


Leu 


Glu 








820 






825 










830 






Tnr 


va± 


uXU 


& 1 -a 

Ala 


Leu Ala 


Phe Glu 


Pro 


Leu 


Ser 


Gin 


Leu 


Leu 


Tyr 


Trp 






o o c 






840 










845 




va i 


Asp 


Ala 


biy 


Phe Lys 


Lys He 


Glu 


Val 


Ala 


Asn 


Pro 


Asp 


Gly 


Asp 




a c r\ 








855 








860 










Pne 


Arg 


Leu 


Thr 


lie Val 


Asn Ser 


Ser 


Val 


Leu 


Asp Arg 


Pro 


Arg 


Ala 


C% C 

865 








870 








875 










880 


Leu 


val 


T A. « 

Leu 


Val 


Pro Gin 


Glu Gly 


Val 


Met 


Phe 


Trp 


Thr 


Asp 


Trp 


Gly 










885 






890 










895 




Asp 


Leu 


Lys 


Pro 


Gly He 


Tyr Arg 


Ser 


Asn 


Met 


Asp Gly 


Ser 


Ala 


Ala 








900 






905 










910 






Tyr 


His 


Leu 


Val 


Ser Glu 


Asp Val 


Lys 


Trp 


Pro 


Asn 


Gly 


He 


Ser 


Val 






915 






920 










925 








Asp 


Asp 


Gin 


Trp 


He Tyr 


Trp Thr 


Asp 


Ala 


Tyr 


Leu 


Glu 


Cys 


He 


Glu 




930 








935 








940 










Arg 


lie 


Thr 


Phe 


Ser Gly Gin Gin 


Arg 


Ser 


Val 


He 


Leu 


Asp 


Asn 


Leu 


945 








950 








955 










960 


Pro 


His 


Pro 


Tyr 


Ala He 


Ala Val 


Phe 


Lys 


Asn 


Glu 


He 


Tyr 


Trp 


Asp 










965 






970 










975 




Asp 


Trp 


Ser 


Gin 


Leu Ser 


He Phe 


Arg 


Ala 


Ser 


Lys 


Tyr 


Ser 


Gly 


Ser 








980 






985 










990 






Gin 


Met 


Glu 


lie 


Leu Ala 


Asn Gin 


Leu 


Thr 


Gly 


Leu 


Met 


Asp 


Met 


Lys 






995 






1000 








1005 






lie 


Phe 


Tyr 


Lys 


Gly Lys 


Asn Thr 


Gly 


Ser 


Asn 


Ala 


Cys 


Val 


Pro 


Arg 




1010 






1015 








1020 








Pro 


Cys 


Ser 


Leu 


Leu Cys 


Leu Pro 


Lys 


Ala 


Asn 


Asn 


Ser Arg 


Ser 


Cys 


1025 






1030 






1035 








1040 


Arg Cys 


Pro 


Glu 


Asp Val 


Ser Ser 


Ser 


Val 


Leu 


Pro 


Ser 


Gly 


Asp 


Leu 










1045 






1050 








1055 


Met 


Cys 


Asp 


Cys 


Pro Gin Gly Tyr Gin 


Leu 


Lys 


Asn 


Asn 


Thr 


Cys 


Val 








1060 




1065 








1070 




Lys 


Glu 


Glu 


Asn 


Thr Cys 


Leu Arg 


Asn 


Gin 


Tyr 


Arg 


Cys 


Ser 


Asn Gly 



1075 1080 1085 



Asn Cys He Asn Ser He Trp Trp Cys Asp Phe Asp Asn Asp Cys Gly 

1090 1095 1100 

Asp Met Ser Asp Glu Arg Asn Cys Pro Thr Thr He Cys Asp Leu Asp 
1105 1110 1115 1120 

Thr Gin Phe Arg Cys Gin Glu Ser Gly Thr Cys He Pro Leu Ser Tyr 

1125 1130 1135 

Lys Cys Asp Leu Glu Asp Asp Cys Gly Asp Asn Ser Asp Glu Ser His 

1140 1145 1150 

Cys Glu Met His Gin Cys Arg Ser Asp Glu Tyr Asn Cys Ser Ser Gly 

1155 1160 1165 

Met Cys He Arg Ser Ser Trp Val Cys Asp Gly Asp Asn Asp Cys Arg 

1170 1175 1180 

Asp Trp Ser Asp Glu Ala Asn Cys Thr Ala He Tyr His Thr Cys Glu 
1185 1190 1195 1200 

Ala Ser Asn Phe Gin Cys Arg Asn Gly His Cys He Pro Gin Arg Trp 

1205 1210 1215 

Ala Cys Asp Gly Asp Thr Asp Cys Gin Asp Gly Ser Asp Glu Asp Pro 
1220 1225 1230 
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Val Asn Cys Glu Lys Lys Cys Asn Gly Phe Arg Cys Pro Asn Gly Thr 

1235 1240 1245 

Cys He Pro Ser Ser Lys His Cys Asp Gly Leu Arg Asp Cys Ser Asp 

1250 . 1255 1260 

Gly Ser Asp Glu Gin His Cys Glu Pro Leu Cys Thr His Phe Met Asp 
1265 1270 • 1275 1280 

Phe Val Cys Lys Asn Arg Gin Gin Cys Leu Phe His Ser Met Val Cys 

1285 1290 1295 

Asp Gly He He Gin Cys Arg Asp Gly Ser Asp Glu Asp Ala Ala Phe 

1300 1305 1310 

Ala Gly Cys Ser Gin Asp Pro Glu Phe His Lys Val Cys Asp Glu Phe 

1315 1320 1325 

Gly Phe Gin Cys Gin Asn Gly Val Cys He Ser Leu He Trp Lys Cys 

1330 1335 1340 

Asp Gly Met Asp Asp Cys Gly Asp Tyr Ser Asp Glu Ala Asn Cys Glu 
1345 1350 1355 1360 

Asn Pro Thr Glu Ala Pro Asn Cys Ser Arg Tyr Phe Gin Phe Arg Cys 

1365 1370 1375 

Glu Asn Gly His Cys He Pro Asn Arg Trp Lys Cys Asp Arg Glu Asn 

1380 1385 1390 

Asp Cys Gly Asp Trp Ser Asp Glu Lys Asp Cys Gly Asp Ser His He 

1395 1400 1405 

Leu Pro Phe Ser Thr Pro Gly Pro Ser Thr Cys Leu Pro Asn Tyr Tyr 

1410 1415 1420 

Arg Cys Ser Ser Gly Thr Cys Val Met Asp Thr Trp Val Cys Asp Gly 
1425 1430 1435 1440 

Tyr Arg Asp Cys Ala Asp Gly Ser Asp Glu Glu Ala Cys Pro Leu Leu 

1445 1450 1455 

Ala Asn Val Thr Ala Ala Ser Thr Pro Thr Gin Leu Gly Arg Cys Asp 

1460 1465 1470 

Arg Phe Glu Phe Glu Cys His Gin Pro Lys Thr Cys He Pro Asn Trp 

1475 1480 ' 1485 

Lys Arg Cys Asp Gly His Gin Asp Cys Gin Asp Gly Arg Asp Glu Ala 

1490 1495 1500 

Asn Cys Pro Thr His Ser Thr Leu Thr Cys Met Ser Arg Glu Phe Gin 
1505 1510 1515 1520 

Cys Glu Asp Gly Glu Ala Cys He Val Leu Ser Glu Arg Cys Asp Gly 

1525 ■ 1530 1535 

Phe Leu Asp Cys Ser Asp Glu Ser Asp Glu Lys Ala Cys Ser Asp Glu 

1540 . 1545 1550 

Leu Thr Val Tyr Lys Val Gin Asn Leu Gin Trp Thr Ala Asp Phe Ser 

1555 1560 1565 

Gly Asp Val Thr Leu Thr Trp Met Arg Pro Lys Lys Met Pro Ser Ala 

1570 1575 1580 

Ser Cys Val Tyr Asn Val Tyr Tyr Arg Val Val Gly Glu Ser He Trp 
1585 1590 1595 1600 

Lys Thr Leu Glu Thr His Ser Asn Lys Thr Asn Thr Val Leu Lys Val 

1605 1610 1615 

Leu Lys Pro Asp Thr Thr Tyr Gin Val Lys Val Gin Val Gin Cys Leu 

1620 1625 1630 

Ser Lys Ala His Asn Thr Asn Asp Phe Val Thr Leu Arg Thr Pro Glu 

1635 1640 1645 

Gly Leu Pro Asp Ala Pro Arg Asn Leu Gin Leu Ser Leu Pro Arg Glu 

1650 1655 1660 

Ala Glu Gly Val He Val Gly His Trp Ala Pro Pro He His Thr His 
1665 1670 1675 1680 

Gly Leu He Arg Glu Tyr He Val Glu Tyr Ser Arg Ser Gly Ser Lys 

1685 1690 1695 

Met Trp Ala Ser Gin Arg Ala Ala Ser Asn Phe Thr Glu He Lys Asn 
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1700 1705 1710 

Leu Leu Val Asn Thr Leu Tyr Thr Val Arg Val Ala Ala Val Thr Ser 

1715 . 1720 1725 

Arg Gly lie Gly Asn Trp Ser Asp Ser Lys Ser lie Thr Thr lie Lys 

1730 1735 1740 

Gly Lys Val lie Pro Pro Pro Asp lie His lie Asp Ser Tyr Gly Glu 
1745 1750 1755 1760 

Asn Tyr Leu Ser Phe Thr Leu Thr Met Glu Ser Asp lie Lys Val Asn 

1765 1770 1775 

Gly Tyr Val Val Asn Leu Phe Trp Ala Phe Asp Thr His Lys Gin Glu 

1780 • 1785 1790 

Arg Arg Thr Leu Asn Phe Arg Gly Ser He Leu Ser His Lys Val Gly 

1795 1800 1805 

Asn Leu Thr Ala His Thr Ser Tyr Glu He Ser Ala Trp Ala Lys Thr 

1810 1815 1820 

Asp Leu Gly Asp Ser Pro Leu Ala Phe Glu His Val Met Thr Arg Gly 
1825 1830 1835 1840 

Val Arg Pro Pro Ala Pro Ser Leu Lys Ala Lys Ala He Asn Gin Thr 

1845 1850 1855 

Ala Val Glu Cys Thr Trp Thr Gly Pro Arg Asn Val Val Tyr Gly He 

1860 1865 1870 

Phe Tyr Ala Thr Ser Phe Leu Asp Leu Tyr Arg Asn Pro Lys Ser Leu 

1875 1880 1885 

Thr Thr Ser Leu His Asn Lys Thr Val He Val Ser Lys Asp Glu Gin 

1890 1895 1900 

Tyr Leu Phe Leu Val Arg Val Val Val Pro Tyr Gin Gly Pro Ser Ser 
1905 1910 1915 1920 

Asp Tyr Val Val Val Lys Met He Pro Asp Ser Arg Leu Pro Pro Arg 

1925 1930 1935 

His Leu His Val Val His Thr Gly Lys Thr Ser Val Val He Lys Trp 

1940 1945 1950 

Glu Ser Pro Tyr Asp Ser Pro Asp Gin Asp Leu Leu Tyr Ala He Ala 

1955 1960 1965 

Val Lys Asp Leu He Arg Lys Thr Asp Arg Ser Tyr Lys Val Lys Ser 

1970 1975 1980 

Arg Asn Ser Thr Val Glu Tyr Thr Leu Asn Lys Leu Glu Pro Gly Gly 
1985 1990 1995 2000 

Lys Tyr His He He Val Gin Leu Gly Asn Met Ser Lys Asp Ser Ser 

2005 2010 2015 

He Lys He Thr Thr Val Ser Leu Ser Ala Pro Asp Ala Leu Lys He 

2020 2025 2030 

He Thr Glu Asn Asp His Val Leu Leu Phe Trp Lys Ser Leu Ala Leu 

2035 2040 2045 

Lys Glu Lys His Phe Asn Glu Ser Arg Gly Tyr Glu He His Met Phe 

2050 2055 2060 

Asp Ser Ala Met Asn He Thr Ala Tyr Leu Gly Asn Thr Thr Asp Asn 
2065 2070 2075 2080 

Phe Phe Lys He Ser Asn Leu Lys Met Gly His Asn Tyr Thr Phe Thr 

2085 2090 2095 

Val Gin Ala Arg Cys Leu Phe Gly Asn Gin He Cys Gly Glu Pro Ala 

2100 2105 2110 

He Leu Leu Tyr Asp Glu Leu Gly Ser Gly Ala Asp Ala Ser Ala Thr 

2115 2120 2125 

Gin Ala Ala Arg Ser Thr Asp Val Ala Ala Val Val Val Pro He Leu 

2130 2135 2140 

Phe Leu He Leu Leu Ser Leu Gly Val Gly Phe Ala He Leu Tyr Thr 
2145 2150 2155 2160 

Lys His Arg Arg Leu Gin Ser Ser Phe Thr Ala Phe Ala Asn Ser His 
2165 2170 2175 
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Tyr Ser Ser Arg Leu Gly Ser Ala lie Phe Ser Ser Gly Asp Asp Leu 

2180 2185 2190 

Gly Glu Asp Asp Glu Asp Ala Pro Met lie Thr Gly Phe Ser Asp Asp 

2195 2200 . 2205 

Val Pro Met Val He Ala 
2210 



<210> 95 

<211> 5980 

<212> DNA 

<213> Homo sapiens 

<400> 95 

gccatgctgt catgagaaag tggcttcttc tcaatgaccc ggaagatacc agttcaggtt 60 
ctaaaggtta tatgaaagtc agcatgtttg tcctgggaac cggagatgag cctcctcctg 120 
agagacgaga tcgtgataat gacagtgatg atgtggagag taatttgtta ctccctgctg 180 
gcattgccct gggaaccgga gatgagcctc ctcctgagag acgagatcgt gataatgaca 240 
gtgatgatgt ggagagtaat ttgttactcc ctgctggcat tgccctccgg tgggtgacct 300 
tcttgctgaa aatctaccga gctgaggaca tcccccagat ggatgatgcc ttctcacaga 360 
cagtaaagga aatatttgga ggcaatgcag ataagaaaaa tctcgtggat ccttttgtag 420 
aagtttcctt tgctggaaaa aaggtttgta caaacataat tgagaaaaat gcaaacecag 480 
agtggaatca ggtcgtcaat cttcagatca agtttccttc agtgtgtgaa aaaataaaac 540 
taacaatata tgactgggac cgtcttacta aaaatgatgt agttggaaca acatatctac 600 
acctctctaa aattgctgcc tctggtgggg aagtggaaga tttctcatct tcgggaactg 660 
gggctgcatc atatacagta aacacaggag aaacagaggt aggctttgtt ccaacgtttg 720 
gaccttgtta cctgaatctt tatggaagcc ccagagagta cacgggattc ccagacccct 780 
atgatgagct gaatactgga aagggggaag gagttgccta cagaggcagg atcttggttg 84 0 
aattagccac ttttcttgag aagacaccac cagataaaaa gcttgagccc atttcaaatg 900 
atgacctgct ggttgttgag aaataccagc gaaggcggaa gtacagcctg tctgccgtgt 960 
ttcattcagc caccatgttg caagatgttg gtgaggccat tcagtttgaa gtcagcattg 1020 
ggaactatgg caacaagttt gacaccacct gtaagccttt ggcatcaaca actcagtaca 1080 
gccgtgctgt atttgatggc aactactatt attacttgcc ttgggcccac accaagccag 1140 
ttgttaccct gacttcatac tgggaggata ttagtcatcg cctggatgcg gtgaacactc 1200 
tcctagctat ggcagaacgg ctgcaaacaa atatagaagc tctaaaatca gggatacaag 1260 
gtaaaattcc tgcaaaccag ctggctgaat tgtggctgaa gctgatagat gaagttatag 1320 
aagacacgag atacacgttg cctctcacag aaggaaaagc caacgtcaca gttctcgata 1380 
ctcagatccg aaagctgcgg tccaggtctc tctcccaaat acatgaggcg gctgtgagga 1440 
tgaggtcgga agccacagat gtgaagtcca cactggcaga aattgaggac tggcttgata 1500 
aattaatgca gctgactgaa gagccacaga acagcatgcc tgacatcatc atctggatga 1560 
tccggggaga gaagagactg gcctatgcac gaattcccgc acatcaggtc ttgtactcca 1620 
ccagtggtga gaatgcatct ggaaaatact gtgggaaaac ccaaaccatc tttctgaagt 1680 
atccacagga gaaaaacaac gggccaaagg tgcctgtgga gttgcgagtg aacatctggc 1740 
taggcttaag tgctgtggag aagaagttta acagcttcgc agaaggaact ttcaccgtct 1800 
ttgctgaaat gtatgaaaat caagctctca tgtttggaaa atggggtact tctggattag 1860 
taggacgtca taagttttct gatgtcacag gaaaaataaa actcaagagg gaattttttc 1920 
tgcctccaaa aggctgggaa tgggaaggag agtggatagt tgatcctgaa agaagcttgc 1980 
tgactgaggc agatgcaggt cacacggagt tcactgatga agtctatcag aacgagagcc 2040 
gctaccccgg gggcgactgg aagccggccg aggacaccta cacggatgcg aacggcgata 2100 
aagcagcatc acccagcgag ttgacttgtc ctccaggttg ggaatgggaa gatgatgcat 2160 
ggtcttatga cataaatcga gcggtggatg agaaaggctg ggaatatgga atcaccattc 2220 
ctcctgatca taagcccaaa tcctgggttg cagcagagaa aatgtaccac actcatagac 2280 
ggcgaaggct ggtccgaaaa cgcaagaaag atttaacaca gactgcttca agcaccgcaa 234 0 
gggccatgga ggaattgcaa gaccaagagg gctgggaata tgcttctcta attggctgga 24 OO' 
aatttcactg gaaacaacgt agttcagata ccttccgccg cagacgctgg aggagaaaaa 24 60 
tggctccttc agaaacacat ggtgcagctg. ccatctttaa . acttgaaggt gcccttgggg 2520 
cagacactac cgaagatggg gatgagaaga gcctggagaa acagaagcac agtgccacca 2580 
ctgtgttcgg agcaaacacc cccattgttt cctgcaattt tgacagagtc tacatctacc 2640 
atctgcgctg ctatgtctat caagccagaa acctcttggc tttagataag gatagctttt 2700 
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cagatccata tgctcatatc tgtttcctcc atcggagcaa aaccactgag atcatccatt 2760 
caaccctgaa tcccacgtgg gaccaaacaa ttatattcga tgaagttgaa atctatgggg 2820 
aaccccaaac agttctacag aatccaccca aagttatcat ggaacttttt gacaatgacc 2880 
aagtgggcaa agatgaattt ttaggacgaa gcattttctc tcctgtggtg aaactgaact 2940 
cagaaatgga catcacaccc aaacttctct ggcacccagt aatgaatgga gacaaagcct 3000 
gcggggatgt tcttgtaact gcagagctga ttctgagggg caaggatggc tccaaccttc 3060 
ccattcttcc ccctcaaagg gcgccaaatc tatacatggt cccccagggg atcaggcctg 3120 
tggtccagct cactgccatt gagattctag cttggggctt aagaaatatg aaaaacttcc 3180 
agatggcttc tatcacatcc cccagtcttg ttgtggagtg tggaggagaa agggtggaat 3240 
cggtggtgat caaaaacctt aagaagacac ccaactttcc aagttctgtt ctcttcatga 3300 
aagtgttctt gcccaaggag gaattgtaca tgcccccact ggtgatcaag gtcatcgacc 3360 
acaggcagtt tgggcggaag cctgtcgtcg gccagtgcac catcgagcgc ctggaccgct 3420 
ttcgctgtga cccttatgca gggaaagagg acatcgtccc acagctcaaa gcctcccttc 3480 
tgtctgcccc accatgccgg gacatcgtta tcgaaatgga agacaccaaa ccattactgg 354 0 
cttctaagct gacagaaaag gaggaagaaa tcgtggactg gtggagtaaa ttttatgctt 3600 
cctcagggga acatgaaaaa tgcggacagt atattcagaa aggctattcc aagctcaaga 3660 
tatataattg cgaactagaa aatgtagcag aatttgaggg cctgacagac ttctcagata 3720 
cgttcaagtt gtaccgaggc aagtcggatg aaaatgaaga tccttctgtg gttggagagt 3780 
ttaagggctc ctttcggatc taccctctgc cggatgaccc cagcgtgcca gcccctccca 3840 
gacagtttcg ggaattacct gacagcgtcc cacaggaatg cacggttagg atttacattg 3900 
ttcgaggctt agagctccag ccccaggaca acaatggcct gtgtgaccct tacataaaaa 3960 
taacactggg caaaaaagtc attgaagacc gagatcacta cattcccaac actctcaacc 4020 
cagtctttgg caggatgtac gaactgagct gctacttacc tcaagaaaaa gacctgaaaa 4080 
tttctgtcta tgattatgac acctttaccc gggatgaaaa agtaggagaa acaattattg 4140 
atctggaaaa ccgattcctt tcccgctttg ggtcccactg cggcatacca gaggagtact 4200 
gtgtttctgg agtcaatacc tggcgagatc aactgagacc aacacagctg cttcaaaatg 4260 
tcgccagatt caaaggcttc ccacaaccca tcctttccga agatgggagt agaatcagat 4 320 
atggaggacg agactacagc ttggatgaat ttgaagccaa caaaatcctg caccagcacc 4380 
tcggggcccc tgaagagcgg cttgctcttc acatcctcag gactcagggg ctggtccctg 4440 
agcacgtgga aacaaggact ttgcacagca ccttccagcc caacatttcc cagggaaaac 4500 
ttcagatgtg ggtggatgtt ttccccaaga gtttggggcc accaggccct cctttcaaca 4560 
tcacaccccg gaaagccaag aaatactacc tgcgtgtgat catctggaac accaaggacg 4 620 
ttatcttgga cgagaaaagc atcacaggag aggaaatgag tgacatctac gtcaaaggct 4 680 
ggattcctgg caatgaagaa aacaaacaga aaacagatgt ccattacaga tctttggatg 4740 
gtgaagggaa ttttaactgg cgatttgttt tcccgtttga ctaccttcca gccgaacaac 4800 
tctgtatcgt tgcgaaaaaa gagcatttct ggagtattga ccaaacggaa tttcgaatcc 4860 
cacccaggct gatcattcag atatgggaca atgacaagtt ttctctggat gactacttgg 4 920 
gtttcctaga acttgacttg cgtcacacga tcattcctgc aaaatcacca gagaaatgca 4 980 
ggttggacat gattccggac ctcaaagcca tgaaccccct taaagccaag acagcctccc 5040 
tctttgagca gaagtccatg aaaggatggt ggccatgcta cgcagagaaa gatggcgccc 5100 
gcgtaatggc tgggaaagtg gagatgacat tggaaatcct caacgagaag gaggccgacg 5160 
agaggccagc cgggaagggg cgggacgaac ccaacatgaa ccccaagctg gacttaccaa 5220 
atcgaccaga aacctccttc ctctggttca ccaacccatg caagaccatg aagttcatcg 5280 
tgtggcgccg ctttaagtgg gtcatcatcg gcttgctgtt cctgcttatc ctgctgctct 5340 
tcgtggccgt gctcctctac tctttgccga actatttgtc aatgaagatt gtaaagccaa 5400 
atgtgtaaca aaggcaaagg cttcatttca agagtcatcc agcaatgaga gaatcctgcc 54 60 
tctgtagacc aacatccagt gtgattttgt gtctgagacc acaccccagt agcaggttac 5520 
gccatgtcac cgagccccat tgattcccag aggktcttag tcctgggaaa gtcaggccaa 5580 
caagcaacgt ttgcatcatg ttatctctta agtattaaaa gttttatttt ctaaagttta 5640 
aatcatggtt tttymaaata tttttcaagg tggctggttc catttaaaaa tcatcttttt 5700 
atatgtgtct tcggttctag acttcagctt ttggaaattg ctaaatagaa ttcaaaaatc 5760 
tctgcatcct gaggtgatat acttcatatt tgtaatcaac tgaaagagct gtgcattata 5820 
aaatcagtta gaatagttag aacaattctt atttatgccc acaaccattg ctatattttg 5880 
tatggatgtc ataaaagtct atttaacctc tgtaatgaaa ctaaataaaa atgtttcacc 5940 
tttaaagaca aaaaaaaaaa gattgagagg acggccgatc 5980 

<210> 96 
<211> 1798 
<212> PRT 
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<213> Homo sapiens 



<400> 96 



Met 


Arg 


Lys 


Trp 


Leu 


1 








5 


Ser 


Lys 


Gly 


Tyr 


Met 








20 




Glu 


Pro 


Pro 


Pro 


Glu 






35 






Glu 


Ser 


Asn 


Leu 


Leu 




50 








Glu 


Pro 


Pro 


Pro 


Glu 


65 










Glu 


Ser 


Asn 


Leu 


Leu 










85 


Phe 


Leu 


Leu 


Lys 


He 








100 




Ala 


Phe 


Ser 


Gin 


Thr 






115 






Lys 


Asn 


Leu 


Val 


Asp 




130 








Val 


Cys 


Thr 


Asn 


He 


145 










Val 


Val 


Asn 


Leu 


Gin 










165 


Leu 


Thr 


He 


Tyr 


Asp 








180 




Thr 


Thr 


Tyr 


Leu 


His 






195 






Glu 


Asp 


Phe 


Ser 


Ser 




210 








Thr 


Gly 


Glu 


Thr 


Glu 


225 










Leu 


Asn 


Leu 


Tyr 


Gly 










245 


Tyr 


Asp 


Glu 


Leu 


Asn 








260 




Arg 


He 


Leu 


Val 


Glu 






275 






Lys 


Lys 


Leu 


Glu 


Pro 




290 








Tyr 


Gin 


Arg 


Arg 


Arg 


305 










Thr 


Met 


Leu 


Gin 


Asp 










325 


Gly Asn 


Tyr 


Gly 


Asn 








J4 U 




Thr 


Thr 


Gin 


Tyr 


Ser 






c 






Leu 


Pro 


Trp 


Ala 


His 




370 








Glu 


Asp 


He 


Ser 


His 


385 










Ala 


Glu 


Arg 


Leu 


Gin 










405 


Gly Lys 


He 


Pro 


Ala 








420 




Asp Glu 


Val 


He 


Glu 



435 



Leu 


Leu Asn 


Asp 


Pro 


Glu 


Asp 


Thr 


Ser 


Ser Gly 








10 










15 


Lys 


Val Ser 


Met 


Phe 


val 


Leu 


Gly 


Thr Gly Asp 






25 










30 




Arg Arg Asp Arg Asp 


Asn 


Asp 


Ser 


Asp Asp Val 




40 










45 






Leu 


Pro Ala 


Gly He 


Ala 


Leu 


Gly 


Thr Gly Asp 




55 ■ 








60 








Arg Arg Asp Arg 


Asp 


Asn 


Asp 


Ser 


Asp Asp Val 


70 








75 








80 


Leu 


Pro Ala 


Gly He 


Ala 


Leu Arg 


Trp 


Val Thr 








90 










95 


Tyr Arg Ala 


Glu 


Asp 


He 


Pro 


Gin 


Met 


Asp Asp 






105 










110 




Val 


Lys Glu 


He 


Phe- 


Gly Gly Asn 


Ala 


Asp Lys 




120 










125 






Pro 


Phe Val 


Glu 


Val 


Ser 


Phe 


Ala 


Gly 


Lys Lys 




135 








140 








He 


Glu Lys 


Asn 


Ala 


Asn 


Pro 


Glu 


Trp Asn Gin 


150 








155 








160 


He 


Lys Phe 


Pro 


Ser 


Val 


Cys 


Glu 


Lys 


He Lys 








170 










175 


Trp Asp Arg 


Leu 


Thr 


Lys 


Asn 


Asp 


Val 


Val Gly 






185 










190 




Leu 


Ser Lys 


He 


Ala 


Ala 


Ser 


Gly 


Gly 


Glu Val 




200 










205 






Ser Gly Thr Gly Ala 


Ala 


Ser 


Tyr 


Thr 


Val Asn 




215 








220 








Val 


Gly Phe 


Val 


Pro 


Thr 


Phe 


Gly 


Pro 


Cys Tyr 


230 








235 








240 


Ser 


Pro Arg 


Glu 


Tyr 


Thr 


Gly 


Phe 


Pro Asp Pro 








250 










255 


Thr 


Gly Lys 


Gly 


Glu 


Gly 


Val 


Ala 


Tyr Arg Gly 






265 










270 




Leu 


Ala Thr 


Phe 


Leu 


Glu 


Lys 


Thr 


Pro 


Pro Asp 




280 










285 






He 


Ser Asn Asp Asp 


Leu 


Leu 


Val 


Val 


Glu Lys 




295 








300 








Lys 


Tyr Ser 


Leu 


Ser 


Ala 


Val 


Phe 


His 


Ser Ala 


310 








315 








320 


Val 


Gly Glu Ala 


He 


Gin 


Phe 


Glu 


Val 


Ser He 








330 










335 


Lys 


Phe Asp 


Thr 


Thr 


Cys 


Lys 


Pro 


Leu 


Ala Ser 






345 










350 




Arg Ala Val 


Phe Asp 


Gly Asn 


Tyr 


Tyr 


Tyr Tyr 




360 










365 






Thr 


Lys Pro 


Val 


Val 


Thr 


Leu 


Thr 


Ser 


Tyr Trp 




375 








380 








Arg Leu Asp Ala Val 


Asn 


Thr 


Leu 


Leu 


Ala Met 


390 








395 








400 


Thr 


Asn He 


Glu 


Ala 


Leu 


Lys 


Ser 


Gly 


He Gin 








410 










415 


Asn 


Gin Leu 


Ala 


Glu 


Leu 


Trp 


Leu 


Lys 


Leu He 






425 










430 




Asp Thr Arg 


Tyr 


Thr 


Leu 


Pro 


Leu 


Thr Glu Gly 




440 










445 
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Lys 


Ala 


Asn 


vai 


inr 


vai 


Leu 


Asp 


Thr 


Gin 


He 


Arg 


Lys 


Leu 


Arg 


Ser 




4 DU 










455 










4 60 










Axg 


oer 


Leu 




bin 


lie 


rll S 


blU 


Ala 


TV 1 A 

Ala 


vai 


TV ~ 

Arg 


Met 


Arg 


Ser 


Glu 


4 OD 










4 70 










475 










480 


TV 1 -V 

Ala 


1 nr 


Asp 


vai 


Lys 


ber 


Tnr 


Leu 


TV 1 

Ala 


Glu 


T 1 A 

He 


Glu 


Asp 


Trp 


Leu 


Asp 










/I Q C 
4 O D 










4 90 










4 95 




Lys 


Leu 




oxn 


Leu 


1 nr 


blU 


T 1 1 
blU 


n „ 
rro 


bin 


Asn 


Ser 


Met 


Pro 


Asp 


He 








D UU 










Q n c 

505 










510 






lie 


Trp 


Meu 


lie 


Arg 


biy 


blU 


Lys 


Arg 


Leu 


TV T -V 

Ala 


Tyr 


Ala 


Arg 


He 






Ol3 










C O A 

520 










c o c 

525 








Pro 


Ala 


His 


bin 


Val 


Leu 


Tyr 


Ser 


rp l_ 

Tnr 


Ser 


Gly 


Glu 


Asn 


Ala 


Ser 


Gly 




c n 










C "3 C 

535 










540 










Lys 


Tyr 


Cys 


Gly 


Lys 


1 nr 


Gin 


Thr 


He 


Phe 


Leu 


Lys 


Tyr 


Pro 


Gin 


Glu 


545 










550 










555 










560 


Lys 


Asn 


Asn 


b±y 


rro 


Lys 


vai 


Pro 


vai 


blU 


lieu 


Arg 


Val 


Asn 


He 


Trp 










c ^ c 
565 










570 










575 




Leu 


Gly 


Leu 


C V- 

ber 


Ala 


vai 


blU 


Lys 


Lys 


Phe 


Asn 


Ser 


Phe 


Ala 


Glu 


Gly 








C. O f\ 










cot 
58 5 










590 






Thr 


Phe 


inr 


val 


rne 


Ala 


blU 


Met 


Tyr 


Glu 


Asn 


Gin 


Ala 


Leu 


Met 


Phe 






c n c 

595 










f r\ r\ 

600 










605 








Gly Lys 


Trp 


Gly 


W 

Tnr 


ser 


Gly 


Leu 


Val 


Gly 


Arg 


His 


Lys 


Phe 


Ser 


Asp 




610 










615 










620 










Val 


Thr 


tjiy 


bys 


lie 


Lys 


Leu 


Lys 


Arg 


Glu 


Pne 


Phe 


Leu 


Pro 


Pro 


Lys 


625 










530 










635 










640 


Gly Trp 


Glu 


Trp 


Glu 


biy 


Glu 


Trp 


He 


Val 


Asp 


Pro 


Glu 


Arg 


Ser 


Leu 










64 5 










650 










655 




Leu 


Thr 


Glu 


Ala 


Asp 


Ala 


biy 


His 


Thr 


Glu 


Phe 


Thr 


Asp 


Glu 


Val 


Tyr 








660 










665 










670 






Gin 


Asn 


Glu 


Ser 


Arg 


Tyr 


•Pro 


Gly 


Gly 


Asp 


Trp 


Lys 


Pro 


Ala 


Glu 


Asp 






675 










680 










685 








Thr 


Tyr 


Tnr 


Asp 


TV 1 -» 

Ala 


TV _ 

Asn 


Gly 


TV 

Asp 


Lys 


TV 1 -» 

Ala 


TV T « 

Ala 


Ser 


Pro 


Ser 


Glu 


Leu 




690 










695 










700 










Thr 


Cys 


Pro 


Pro 


biy 


Trp 


blU 


Trp 


Glu 


Asp 


Asp 


TV 1 -s 

Ala 


Trp 


Ser 


Tyr 


Asp 


705 










710 










715 










720 


He 


Asn 


Arg 


Ala 


Val 


Asp 


Glu 


Lys 


Gly 


Trp 


Glu 


Tyr 


Gly 


He 


Thr 


He 










725 










730 










735 




Pro 


Pro 


Asp 


HXS 


Lys 


Pro 


Lys 


ber 


Trp 


Val 


TV T ^ 

Ala 


TV 1 -% 

Ala 


Glu 


Lys 


Met 


Tyr 
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750 






His 


Thr 


nXS 


Arg 


Arg 


Arg 


Arg 


Leu 


val 


Arg 


Lys 


Arg 


Lys 


Lys 


TV AWN. 

Asp 


Leu 






"7 C C 










7 60 










765 








Thr 


Gin 


inr 


Ala 


Ser 


Ser 


rp l_ „ 

inr 


Ala 


Arg 


TV 1 — 

Ala 


Met 


Glu 


Glu 


Leu 


Gin 


Asp 




770 










T T C 
/ /5 










780 










Gin 


Glu 


r^ \7 
\3xy 


Trp 


bJ-U 


Tyr 


A J. a 


ber 


Leu 


T 1 « 

lie 


biy 


Trp 


Lys 


pne 


HlS 


Trp 


785 










/ yu 










"7 rt C 

795 










r\ r\ r\ 

800 


Lys 


Gin 


Arg 


oer 


Ser 


TV i^vo 

ASp 


inr 


Pne 


Arg 


Arg 


Arg 


Arg 


Trp 


Arg 


Arg 


Lys 










one 










810 










815 




Met 


Ala 


Pro 


C ^ y» 

oer 


blU 


1 nr 


n J.S 


biy 


Tk T 

Ala 


Ala 


TV 1 

Ala 


lie 


1-JK _ 

Pne 


Lys 


Leu 


blU 


















o o c 
o25 










o "O n 

o30 






Gly Ala 


Leu 


biy 


Ala 


Asp 


inr 


inr 


blU 


TV 

Asp 


Gly 


Asp 


Glu 


Lys 


Ser 


Leu 
















Q A r\ 
o4 U 










845 








Glu 


Lys 


bin 


Lys 


nlS 


Ser 


Ala 


mr 


rp _ 

Tnr 


Val 


Pne 


Gly 


TV 1 « 

Ala 


Asn 


Thr 


Pro 




850 










o c c 










860 










He 


Val 






2i crt 
noil 




TV er\ 


riJT g 


Vol 


Tyr 


lie 


Tyr 


nlS 


Leu 


Arg 


Cys 


865 










870 










875 










880 


Tyr 


Val 


Tyr 


Gin 


Ala 


Arg 


Asn 


Leu 


Leu 


Ala 


Leu 


Asp 


Lys 


Asp 


Ser 


Phe 










885 










890 










895 




Ser Asp 


Pro 


Tyr 


Ala 


His 


He 


Cys 


Phe 


Leu 


His 


Arg 


Ser 


Lys 


Thr 


Thr 








900 










905 










910 






Glu 


He 


He 


His 


Ser 


Thr 


Leu 


Asn 


Pro 


Thr 


Trp 


Asp 


Gin 


Thr 


He 


He 
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915 920 925 

Phe Asp Glu Val Glu He Tyr Gly Glu Pro Gin Thr Val Leu Gin Asn 

930 935 940 

Pro Pro Lys Val He Met Glu Leu Phe Asp Asn Asp Gin Val Gly Lys 
945 950 955 960 

Asp Glu Phe Leu Gly Arg Ser He Phe Ser Pro Val Val Lys Leu Asn 

965 970 975 

Ser Glu Met Asp lie Thr Pro Lys Leu Leu Trp His Pro Val Met Asn 

980 985 990 

Gly Asp Lys Ala Cys Gly Asp Val Leu Val Thr Ala Glu Leu He Leu 

995 1000 1005 

Arg Gly Lys Asp Gly Ser Asn Leu Pro He Leu Pro Pro Gin Arg Ala 

1010 1015 1020 

Pro Asn Leu Tyr Met Val Pro Gin Gly He Arg Pro Val Val Gin Leu 
1025 1030 1035 1040 

Thr Ala He Glu He Leu Ala Trp Gly Leu Arg Asn Met Lys Asn Phe 

1045 1050 1055 

Gin Met Ala Ser He Thr Ser Pro Ser Leu Val Val Glu Cys Gly Gly 

1060 1065 1070 

Glu Arg Val Glu Ser Val Val He Lys Asn Leu Lys Lys Thr Pro Asn 

1075 1080 1085 

Phe Pro Ser Ser Val Leu Phe Met Lys Val Phe Leu Pro Lys Glu Glu 

1090 1095 1100 

Leu Tyr Met Pro Pro Leu Val He Lys Val He Asp His Arg Gin Phe 
1105 1110 1115 1120 

Gly Arg Lys Pro Val Val Gly Gin Cys Thr He Glu Arg Leu Asp Arg 

1125 1130 1135 

Phe Arg Cys Asp Pro Tyr Ala Gly Lys Glu Asp He Val Pro Gin Leu 

1140 1145 1150 

Lys Ala Ser Leu Leu Ser Ala Pro Pro Cys Arg Asp He Val He Glu 

1155 1160 1165 

Met Glu Asp Thr Lys Pro Leu Leu Ala Ser Lys Leu Thr Glu Lys Glu 

1170 1175 1180 

Glu Glu He Val Asp Trp Trp Ser Lys Phe Tyr Ala Ser Ser Gly Glu 
1185 1190 1195 1200 

His Glu Lys Cys Gly Gin Tyr He Gin Lys Gly Tyr Ser Lys Leu Lys 

1205 1210 1215 ' 

He Tyr Asn Cys Glu Leu Glu Asn Val Ala Glu Phe Glu Gly Leu Thr 

1220 1225 1230 

Asp Phe Ser Asp Thr Phe Lys Leu Tyr Arg Gly Lys Ser Asp Glu Asn 

1235 1240 1245 

Glu Asp Pro Ser Val Val Gly Glu Phe Lys Gly Ser Phe Arg He Tyr 

1250 1255 1260 

Pro Leu Pro Asp Asp Pro Ser Val Pro Ala Pro Pro Arg .Gin Phe Arg 
1265 1270 1275 1280 

Glu Leu Pro Asp Ser Val Pro Gin Glu Cys Thr Val Arg He Tyr He 

1285 1290 1295 

Val Arg Gly Leu Glu Leu Gin Pro Gin Asp Asn Asn Gly Leu Cys Asp 

1300 1305 1310 

Pro Tyr He Lys He Thr Leu Gly Lys Lys Val He Glu Asp Arg Asp 

1315 1320 1325 

His Tyr He Pro Asn Thr Leu Asn Pro Val Phe Gly Arg Met Tyr Glu 

1330 1335 1340 

Leu Ser Cys Tyr Leu Pro Gin Glu Lys Asp Leu Lys He Ser Val Tyr 
1345 1350 1355 1360 

Asp Tyr Asp Thr Phe Thr Arg Asp Glu Lys Val Gly Glu Thr He He 

1365 1370 1375 

Asp Leu Glu Asn Arg Phe Leu Ser Arg Phe Gly Ser His Cys Gly He 
1380 1385 1390 
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Pro Glu Glu Tyr Cys Val Ser Gly Val Asn Thr Trp Arg Asp Gin Leu 

1395 1400 1405 

Arg Pro Thr Gin Leu Leu Gin Asn Val Ala Arg Phe Lys Gly Phe Pro 

1410 1415 1420 

Gin Pro lie Leu Ser Glu Asp Gly Ser Arg He Arg Tyr Gly Gly Arg 
1425 1430 1435 1440 

Asp Tyr Ser Leu Asp Glu Phe Glu Ala Asn Lys He Leu His Gin His 

1445 1450 1455 

Leu Gly Ala Pro Glu Glu Arg Leu Ala Leu His lie Leu Arg Thr Gin 

1460 1465 1470 

Gly Leu Val Pro Glu His Val Glu Thr Arg Thr Leu His Ser Thr Phe 

1475 1480 1485 

Gin Pro Asn He Ser Gin Gly Lys Leu Gin Met Trp Val Asp Val Phe 

1490 1495 1500 

Pro Lys Ser Leu Gly Pro Pro Gly Pro Pro Phe Asn He Thr Pro Arg 
1505 1510 1515 1520 

Lys Ala Lys Lys Tyr Tyr Leu Arg Val He He Trp Asn Thr Lys Asp 

1525 1530 1535 

Val He Leu Asp Glu Lys Ser He Thr Gly Glu Glu Met Ser Asp He 

1540 1545 1550 

Tyr Val Lys Gly Trp He Pro Gly Asn Glu Glu Asn Lys Gin Lys Thr 

1555 1560 1565 

Asp Val His Tyr Arg Ser Leu Asp Gly Glu Gly Asn Phe Asn Trp Arg 

1570 1575 1580 

Phe Val Phe Pro Phe Asp Tyr Leu Pro Ala Glu Gin Leu Cys He Val 
1585 1590 1595 1600 

Ala Lys Lys Glu His Phe Trp Ser He Asp Gin Thr Glu Phe Arg He 

1605 1610 1615 

Pro Pro Arg Leu He He Gin He Trp Asp Asn Asp Lys Phe Ser Leu 

1620 1625 1630 

Asp Asp Tyr Leu Gly Phe Leu Glu Leu Asp Leu Arg His Thr He He 

1635 1640 1645 

Pro Ala Lys Ser Pro Glu Lys Cys Arg Leu Asp Met He Pro Asp Leu 

1650 1655 1660 

Lys Ala Met Asn Pro Leu Lys Ala Lys Thr Ala Ser Leu Phe Glu Gin 
1665 1670 1675 1680 

Lys Ser Met Lys Gly Trp Trp Pro Cys Tyr Ala Glu Lys Asp Gly Ala 

1685 1690 1695 

Arg Val Met Ala Gly Lys Val Glu Met Thr Leu Glu He Leu Asn Glu 

1700 1705 1710 

Lys Glu Ala Asp Glu Arg Pro Ala Gly Lys Gly Arg Asp Glu Pro Asn 

1715 1720 1725 

Met Asn Pro Lys Leu Asp Leu Pro Asn Arg Pro Glu Thr Ser Phe Leu 

1730 1735 1740 

Trp Phe Thr Asn Pro Cys Lys Thr Met Lys Phe He Val Trp Arg Arg 
1745 1750 1755 1760 

Phe Lys Trp Val He He Gly Leu Leu Phe Leu Leu He Leu Leu Leu 

1765 1770 1775 

Phe Val Ala Val Leu Leu Tyr Ser Leu Pro Asn Tyr Leu Ser Met Lys 

1780 1785 1790 

He Val Lys Pro Asn Val 
1795 



<210> 97 

<211> 3724 

<212> DNA 

<213> Homo sapiens 
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<400> 97 

gaattccggc ccggcatccc gatggccgcc gctgggcccc ggcgctccgt gcgcggagcc 60 
gtctgcctgc atctgctgct gaccctcgtg atcttcagtc gtgatggtga agcctgcaaa 120 
aaggtgatac ttaatgtacc ttctaaacta gaggcagaca aaataattgg cagagttaat 180 
ttggaagagt gcttcaggtc tgcagacctc atccggtcaa gtgatcctga tttcagagtt 240 
ctaaatgatg ggtcagtgta cacagccagg gctgttgcgc tgtctgataa gaaaagatca 300 
tttaccatat ggctttctga caaaaggaaa cagacacaga aagaggttac tgtgctgcta 360 
gaacatcaga agaaggtatc gaagacaaga cacactagag aaactgttct caggcgtgcc 420 
aagaggagat gggcacctat tccttgctct atgcaagaga attccttggg ccctttccca 480 
ttgtttcttc aacaagttga at'ctgatgca gcacagaact atactgtctt ctactcaata 540 > 
agtggacgtg gagttgataa agaaccttta aatttgtttt atatagaaag agacactgga 600 
aatctatttt gcactcggcc tgtggatcgt gaagaatatg atgtttttga tttgattgct 660 
tatgcgtcaa ctgcagatgg atattcagca gatctgcccc tcccactacc catcagggta 720 
gaggatgaaa atgacaacca ccctgttttc acagaagcaa tttataattt tgaagttttg 780 
gaaagtagta gacctggtac tacagtgggg gtggtttgtg ccacagacag agatgaaccg 840 
gacacaatgc atacgcgcct gaaatacagc attttgcagc agacaccaag gtcacctggg 900 
ctcttttctg tgcatcccag cacaggcgta atcaccacag tctctcatta tttggacaga 960 
gaggttgtag acaagtactc attgataatg aaagtacaag acatggatgg ccagtttttt 1020 
ggattgatag gcacatcaac ttgtatcata acagtaacag attcaaatga taatgcaccc 1080 
actttcagac aaaatgctta tgaagcattt gtagaggaaa atgcattcaa tgtggaaatc 1140 
ttacgaatac ctatagaaga taaggattta attaacactg ccaattggag agtcaatttt 1200 
accattttaa agggaaatga aaatggacat ttcaaaatca gcacagacaa agaaactaat 1260 
gaaggtgttc tttctgttgt aaagccactg aattatgaag aaaaccgtca agtgaacctg 1320 
gaaattggag taaacaatga agcgccattt gctagagata ttcccagagt gacagccttg 1380 
aacagagcct tggttacagt tcatgtgagg gatctggatg aggggcctga atgcactcct 14 40 
gcagcccaat atgtgcggat taaagaaaac ttagcagtgg ggtcaaagat caacggctat 1500 
aaggcatatg accccgaaaa tagaaatggc aatggtttaa ggtacaaaaa attgcatgat 1560 
cctaaaggtt ggatcacca.t tgatgaaatt tcagggtcaa tcataacttc caaaatcctg 1620 
gatagggagg ttgaaactcc caaaaatgag ttgtataata ttacagtcct ggcaatagac 1680 
aaagatgata gatcatgtac tggaacactt gctgtgaaca ttgaagatgt aaatgataat 1740 
ccaccagaaa tacttcaaga atatgtagtc atttgcaaac caaaaatggg gtataccgac 1800 
attttagctg ttgatcctga tgaacctgtc catggagctc cattttattt cagtttgccc 1860 
aatacttctc cagaaatcag tagactgtgg agcctcacca aagttaatga tacagctgcc 1920 
cgtctttcat atcagaaaaa tgctggattt caagaatata ccattcctat tactgtaaaa 1980 
gacagggccg gccaagctgc aacaaaatta ttgagagtta atctgtgtga atgtactcat 204 0 
ccaactcagt gtcgtgcgac ttcaaggagt acaggagtaa tacttggaaa atgggcaatc 2100 
cttgcaatat tactgggtat agcactgctc ttttctgtat tgctaacttt agtatgtgga 2160 
gtttttggtg caactaaagg gaaacgtttt cctgaagatt tagcacagca aaacttaatt 2220 
atatcaaaca cagaagcacc tggagacgat agagtgtgct ctgccaatgg atttatgacc 2280 
caaactacca acaactctag ccaaggtttt tgtggtacta tgggatcagg aatgaaaaat 2340 
ggagggcagg aaaccattga aatgatgaaa ggaggaaacc agaccttgga atcctgccgg 2400 
ggggctgggc atcatcatac cctggactcc tgcaggggag gacacacgga ggtggacaac 24 60 
tgcagataca cttactcgga gtggcacagt tttactcagc cccgtctcgg tgaaaaattg 2520 
catcgatgta atcagaatga agaccgcatg ccatcccaag attatgtcct cacttataac 2580 
tatgagggaa gaggatctcc agctggttct gtgggctgct gcagtgaaaa gcaggaagaa 2640 
gatggccttg actttttaaa taatttggaa cccaaattta ttacattagc agaagcatgc 2700 
agtgctacaa ttaggtcttt gtcagacatt ctggaggttt ccaaaaataa tattgtaaag 2760 
ttcaatttca acatgtatgt atatgatgat ttttttctca attttgaatt atgctactca 2820 
ccaattatat ttttaaagca agttgttgct tatcttttcc aaaaagtgaa aaatgttaaa 2880 
acagacaact ggtaaatctc aaactccagc actggaatta aggtctctaa agcatctgct 2940 
cttttttttt ttacggatat tttagtaata aatatgctgg ataaatatta gtccaacaat 3000 
agctaagtta tgctaatatc acattattat gtattcactt taagtgatag tttaaaaaat 3060 
aaacaagaaa tattgagtat cactatgtga agaaagtttt ggaaaagaaa caatgaagac 3120 
tgaattaaat taaaaatgtt gcagctcata aagaattggg actcacccct actgcactac 3180 
caaattcatt tgactttgga ggcaaaatgt gttgaagtgc cctatgaagt agcaattttc 3240 
tataggaata tagttggaaa taaatgtgtg tgtgtatatt attattaatc aatgcaatat 3300 
ttaaaatgaa atgagaacaa agaggaagat ggtaaaaact tgaaatgagg ctggggtata 3360 
gtttgttcta caatgaaaaa agagagagct ttctaggcct gggctcttaa atgctgcatt 34 20 
ataactgagt ctatgaggaa ataagtcctg ttcaaattgt gtaatttgtt taaaatgtaa 34 80 
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ataaataaac ttttctggtt tctgtgggaa ggaaataggg aatccaatgg aacagtagct 3540 

ttgctttgca gtctgtttca agatttctgc atccacaagt tagtagcaaa ctggggaata 3600 

ctcgctgcag ctggggttcc ctgctttttg gtagcaaggg tccagagatg agggtgtttt 3660 

tttcggggag ctaataacaa aaacatttta aaacttacct ttactgaagt taaatcctta 3720 

ttgc 3724 

<210> 98 
<211> 896 
<212> PRT 

<213> Homo sapiens 
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Asp 
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O A W 
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Asp 
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Arg 


val 
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7\ 
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Asp 
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Ser 


Val 


Tyr 
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Arg 


Ala 


Val 
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Ser 
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Lys 


Lys 


Arg 


Ser 


Phe 


Thr 


He 
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Lys 
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Val 
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Glu 


Thr 
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Arg 
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He 


Pro 
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Ser 


Met 
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1 4 A 
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Gly 
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Pro 
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Glu 
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He 
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Val 
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Ser 
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Val 
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Val 


Val 
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Arg Asp 
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He 
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Pro 


Gly 


Leu 


Phe 


Ser 


Val 


His 


Pro 


Ser 


Thr 


Gly Val 


He 
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Val 


Val 


Asp 
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He 


Met 
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Val 
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Met 
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Gly 
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Phe 


Phe 


Gly Leu 


He 
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Thr 
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He 


Thr 


Val 
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Ser 
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Glu 
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J / U 








J ID 
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7\*<« I'V^^ 

Hsn inr Hia 


Asn 


Trp Arg 


vax 


Asn 


Phe Thr He 


Leu Lys 


Gly Asn Glu 


ODD 






"2 r\ r\ 
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400 
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Lys 


xxe 


Ser 


Thr Asp Lys Glu 


Thr Asn 


Glu Gly Val 
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4 10 
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Lieu Vox 


Val 


Lys 


Pro 


Leu 


Asn 


Tyr Glu Glu 


Asn Arg Gin 


Val Asn 




420 
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Taii TIa 

ijcu uxu xjie 


Gly Val 


Asn 


Asn 


Glu 


Ala Pro Phe 


Ala Arg Asp 


He Pro 


4 JO 
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r\L\^ vox inx 


Ala 


Leu 


Asn 


Arg 


Ala 


Leu Val Thr 


Val His 


Val 


Arg Asp 


fi DU 
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460 




JjeU rVSp oXU 


Gly 


Pro 


Glu 


Cys 


Thr 


Pro Ala Ala 


Gin Tyr 


Val 


Arg He 
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Val 


Gly 
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Lys He Asn 


Gly Tyr 


Lys 


Ala Tyr 






485 








4 90 






495 


Msp trlTO uXU 


Asn Arg Asn Gly 


Asn Gly Leu Arg 


Tyr Lys 


Lys 


Leu His 




500 
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510 




ASp lrJ.O liyS 


Gly Trp 


He 


Thr 


lie 


Asp Glu He 


Ser Gly 


Ser 


He He 


oxo 










520 




525 






1 fix OCX ijys 


lie 


Leu 


Asp Arg 


Glu 


Val Glu Thr 


Pro Lys 


Asn 


Glu Leu 


c; n 








535 






540 






1 yr Asn iie 


Thr 


Val 


Leu 


Ala 


He 


Asp Lys Asp Asp Arg 


Ser 


Cys Thr 








550 






555 






560 


y^T., rpLi.^ T^ii 

oxy inr iieu 


AX a 


Val 


Asn 


He 


Glu 


Asp Val Asn Asp Asn 


Pro 


Pro Glu 






565 








570 






575 


Tl » T All »<s 

xxe Lieu uxn 


Glu 


Tyr 


Val 


Val 


He 


Cys Lys Pro 


Lys Met 


Gly Tyr Thr 




580 










585 




590 




MSp XXc XiGU 


Ala 


Val 


Asp 


Pro 


Asp Glu Pro Val 


His Gly 


Ala 


Pro Phe 












600 




605 






'Ptfv DV^A 

lyr rne oer 


Leu 


Pro 


Asn 


Thr 


Ser 


Pro Glu He 


Ser Arg 


Leu 


Trp Ser 


DX u 








615 






620 




T«?»iT T^V"!-*- Ttri^ 

Lieu inr -uys 


Val 


Asn 


Asp 


Thr 


Ala 


Ala Arg Leu 


Ser Tyr 


Gin 


Lys Asn 








630 






635 






640 


Axa ijXy rne 


Gin 


Glu 


Tyr 


Thr 


He 


Pro He Thr 


Val Lys Asp Arg Ala 






645 








650 






655 


/^li/ Pin ala 

Vaxy vjxn AXo 


Ala 


Thr 


Lys 


Leu 


Leu Arg Val Asn 


Leu Cys 


Glu 


Cys Thr 




660 










665 




670 


nXS ITXO iXix 


Gin 


Cys 


Arg Ala 


Thr 


Ser Arg Ser 


Thr Gly 


Val 


He Leu 


D / O 










680 




685 






vsxy Liyb 1 xp 


Ala 


He 


Leu 


Ala 


He 


Leu Leu Gly 


He Ala 


Leu 


Leu Phe 


D y u 








695 






700 






OCX VclX Xic^U 


Leu 


Thr 


Leu 


Val 


Cys 


Gly Val Phe 


Gly Ala 


Thr 


Lys Gly 








710 






715 






720 


Liyo rixy iriie 


Pro 


Glu 


Asp 


Leu 


Ala 


Gin Gin Asn 


Leu He 


He 


Ser Asn 






725 








730 






735 


Thr m n AT 
xiix riXd 


Pro 


Gly Asp Asp 


Arg 


Val Cys Ser Ala Asn Gly Phe Met 




740 










745 




750 




llIX UXil XIIX 


Thr 


Asn 


Asn 


Ser 


Ser 


Gin Gly Phe Cys Gly Thr Met Gly 












760 




765 






OCX oxy net. 


Lys Asn Gly Gly 


Gin 


Glu Thr He 


Glu Met 


Met 


Lys Gly 


/ / u 








775 






780 




f^l\7 Zicn fZln 
wxy rioii 


Thr 


Leu 


Glu 


Ser 


Cys 


Arg Gly Ala 


Gly His 


His 


His Thr 


7ft 






790 






795 






800 


Leu Asp Ser 


Cys Arg Gly Gly 


His 


Thr Glu Val 


Asp Asn Cys Arg Tyr 






805 








810 






815 


Thr Tyr Ser 


Glu 


Trp 


His 


Ser 


Phe 


Thr Gin Pro 


Arg Leu 


Gly Glu Lys 




820 










825 




830 




Leu His Arg 


Cys 


Asn 


Gin 


Asn 


Glu 


Asp Arg Met 


Pro Ser 


Gin 


Asp Tyr 


835 










840 




845 
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Val Leu Thr Tyr Asn Tyr Glu Gly Arg Gly Ser Pro Ala Gly Ser Val 

850 855 860 

Gly Cys Cys Ser Glu Lys Gin Glu Glu Asp Gly Leu Asp Phe Leu Asn 
865 870 875 880 

Asn Leu Glu Pro Lys Phe lie Thr Leu Ala Glu Ala Cys Thr Lys Arg 
885 890 895 



<210> 99 

<211> 2391 

<212> DNA 

<213> Homo sapiens 

<400> 99 

gtcgacccac gcgtccgccg ctacgaggcc ctgcgtgggg agcagccccc ggaccttgag 60 
acaacagtca ttctgcctga gtctgtcttc agagagacgc cccccgtggt caggcccgca 120 
ggccccggag aggcccagga gccagaggag ctggcacggc gacagcgacg gcacccggag 180 
ctgagccagg gtgaggctgt ggccagcgtc atcatctacc gcaccctggc cgggctaetg 240 
cctcataact atgaccctga caagcgcagc ttgagagtcc ccaaacgccc gatcatcaac 300 
acacccgtgg tgagcatcag cgtccatgat gatgaggagc ttctgccccg ggccctggac 360 
aaacccgtca cggtgcagtt ccgcctgctg gagacagagg agcggaccaa gcccatctgt 420 
gtcttctgga accattcaat cctggtcagt ggcacaggtg gctggtcggc cagaggctgt 480 
gaagtcgtct tccgcaatga gagccacgtc agctgccagt gcaaccacat gacgagcttc 540 
gctgtgctca tggacgtttc tcggcgggag aatggggaga tcctgccact gaagacactg 600 
acatacgtgg ctctaggtgt caccttggct gcccttctgc tcaccttctt cttcctcact 660 
ctcttgcgta tcctgcgctc caaccaacac ggcatccgac gtaacctgac agctgccctg 720 
ggcctggctc agctggtctt cctcctggga atcaaccagg ctgacctccc ttttgcctgc 780 
acagtcattg ccatcctgct gcacttcctg tacctctgca ccttttcctg ggctctgctg 840 
gaggccttgc acctgtaccg ggcactcact gaggtgcgcg atgtcaacac cggccccatg 900 
cgcttctact acatgctggg ctggggcgtg cctgccttca tcacagggct agccgtgggc 960 
ctggaccccg agggctacgg gaaccctgac ttctgctggc tctccatcta tgacacgctc 1020 
atctggagtt ttgctggccc ggtggccttt gccgtctcga tgagtgtctt cctgtacatc 1080 
ctggcggccc gggcctcctg tgctgcccag cggcagggct ttgagaagaa aggtcctgtc 1140 
tcgggcctgc agccctcctt cgccgtcctc ctgctgctga gcgccacgtg gctgctggca 1200 
ctgctctctg tcaacagcga caccctcctc ttccactacc tctttgctac ctgcaattgc 1260 
atccagggcc ccttcatctt cctctcctat gtggtgctta gcaaggaggt ccggaaagca 1320 
ctcaagcttg cctgcagccg caagcccagc cctgaccctg ctctgaccac caagtccacc 1380 
ctgacctcgt cctacaactg ccccagcccc tacgcagatg ggcggctgta ccagccctac 1440 
ggagactcgg ccggctctct gcacagcacc agtcgctcgg gcaagagtca gcccagctac 1500 
atccccttct tgctgaggga ggagtccgca ctgaaccctg gccaagggcc ccctggcctg 1560 
ggggatccag gcagcctgtt cctggaaggt caagaccagc agcatgatcc tgacacggac 1620 
tccgacagtg acctgtcctt agaagacgac cagagtggct cctatgcctc tacccactca 1680 
tcagacagtg aggaggaaga agaggaggag gaagaggagg ccgccttccc tggagagcag 174 0 
ggctgggata gcctgctggg gcctggagca gagagactgc ccctgcacag tactcccaag 1800 
gatgggggcc cagggcctgg caaggccccc tggccaggag actttgggac cacagcaaaa 1860 
gagagtagtg gcaacggggc ccctgaggag cggctgcggg agaatggaga tgccctgtct 1920 
cgagaggggt ccctaggccc ccttccaggc tcttctgccc agcctcacaa aggcatcctt 1980 
aagaagaagt gtctgcccac catcagcgag aagagcagcc tcctgcggct ccccctggag 2040 
caatgcacag ggtcttcccg gggctcctcc gctagtgagg gcagccgggg cggcccccct 2100 
ccccgcccac cgccccggca gagcctccag gagcagctga acggggtcat gcccatcgcc 2160 
atgagcatca aggcaggcac ggtggatgag gactcgtcag gctccgacag cgacgaaacg 2220 
tccatctgag gagcctgggc cttgccggga ggggtactca ccccacctaa ggccatctag 2280 
tgccaactcc ccccccacca ttcccctcac tgcactttgg acccctgggg ccaacatctc 2340 
caagacaaag tttttcagaa aagaggaaaa aaaaaaaaaa agggcggccg c 2391 

<210> 100 
<211> 565 
<212> PRT 

<213> Homo sapiens 
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<400> 100 



Met 


Thr 






Al a 


Vox 




Mel: 


A<o 


vox 


^py 

OCX 


Ayn 
AX ^ 


A y 
AX ^ 


<^l n 

\3X U 


Asil 


oxy 


1 


















X u 










X ^ 




Glu 


He 








Xiy o 


Thr 

X IIX 


Leu 


Thr 


Tvy 
lyt 


Val 

vox 


Ala 


T.P11 


ox y 


Va 1 
vox 


Thy 
1 tlx 








20 










25 










O V 






Leu 


Ala 


AT a 


T.on 






4i 1 IX 


Phe 


Phe 


Phe 


Leu 


Thy 

X XlX 


T .Pn 

XiC u 


T.pn 

XjCU 


A y rr 

AX ^ 


Tl P 
X xc 






o -J 










40 










4 S 








Leu Arg 


i. 




m n 

UIX 1 1 


His 


Gly 


lie 


Arg 


Arg 


Asn 


Leu 


Thr 


Ala 


Ala 


x*c u* 




50 




















fin 










Gly Leu 


Ala 


m n 

V3X11 


XldX 


Val 


Phe 


Leu 


Leu 


lie 


Asn 


Gin 


Ala 


AO^ 


Leu 


pyn 


65 




















75 










PO 


Phe 


Ala 


Cys 


Thr 


Val 


He 


Ala 


lie 


Leu 


Leu 


His 


Phe 


Leu 


Tvr 

1 yx 


Leu 


Cys 










ft'=^ 
























Thr 


Phe 


Ocl. 


irp 


Al A 


jjeu 


Leu 


Glu 


Al a 
nx a 


T.P11 

XlC Ll 


H-i e; 
nx o 


XlCU 


1 yr 


A y o 
AX y 


Ala 

AXcl 


XlC u 








1 nn 

i U u 










1 ns 

X U «7 










110 
X X u 






Thr 


Glu 


V cl-L 


Arg 


Asp 


vox 


A 


Thy 

JL I IX 


ni V 
ox y 


Pyn 

JTX 




A r-rr 
Ax y 


Php 

tile 


iyr 


1 yr 


















^ 90 

X ^ u 










1 9 R 








Leu Gly 




o J.y 


V cix 


P yrt 

t X *J 


Ala 


Phe 


Tip 
X xc 


Thy 

1 liX 


Gly 


T .PI 1 


Al a 
AX a 


Va 1 
Vox 


(^1 w 


T .PT 1 
XjC Li 














X J .J 










1 40 










Asp 


Pro 




ux y 


iyr 


1 \/ 
\j±y 


Asn 


XT LKJ 


- 

Asp 


Pho 
tr lit; 


^ys 


irp 


Leu 


OCX 


Tip 
X X c 


iyr 












X o u 










X^ J 










X DU 


Asp 


inr 


Leu 


T 1 o 


1 X p 


oex 


PHo 


Ala 


Vjxy 


rx O 


Va 1 
V dX 


Ala 
AXo 


rrnc 


al a 

AXo 


\/a 1 

Vox 


ocx 




















170 
X / U 










1 7 




lYieT- 


Ser 




IT lie 


Leu 


lyr 


Tip 


Lieu 


Ala 
riXO 


al a 

riX a 


Arg 


Ala 
AXo 


OCX 


\»-.y a 


Ala 
AX a 


Al a 
aXo 








1 ft n 
± o u 










X 0 3 


















Arg 


O J.1 1. 


J. y 


IT lit; 


ox Ll 


j-iy s 


ijy s 


vjx y 


r i. U 


Va 1 
V d X 


OCX 


o X y 


Leu 


U X i 1 


Pt-o 
t X 
















z u u 










9 n c; 
^ U 0 








Ser 


Phe 




V o.L 


Leu 


Leu 


Leu 


Leu 


OCX 


riXa 


1 fix 


irp 


Leu 


Leu 


zn a 


Leu 




210 










^ X D 










990 










Leu 


Ser 


V a. J. 


A Qn 


h7C?X 


Asp 


Thr 

1 iIX 


T.PI1 


T.Pi 1 


Php 


nx s 


Twy 

iyr 


T.Pi 1 

XiCU 


Php 


Ala 

AX Cl 


Thy 

i. ilX 


225 










Z J u 




















9 4 n 

Z 'I u 


Cys 


Asn 


jr o 




Gin 


vjxy 


Pyrt 
irx u 


Php 


Tip 
X xc 


Php 


T.P11 

XlCU 


Qp y 

OCX 


T\/y 

1 yr 


Val 
vox 


Va 1 
vox 


T.P1 1 
XiCu 










^ M ^ 




















Z Do 




Ser 


Lys 




vox 


rlx ^ 


T .\/c 

ijyo 


Al a 

nXO 




T \rc 
Xiy2> 


T Pii 


Ala 
AXa 




Q ck >* 

OCX 


Arg 


Lys 


Pro 








9 fin 

^ vJ VJ 










9 










97 0 






Ser 


Pro 




t X vj 


raX d 


i^eu 


Thr 

1 IIX 


Thy 

1 1 IX 


Xiy o 


Otr X 


Th y 

1 IIX 


■Lieu 


Th y 
1 i ix 


*^py 

OCX 


OCX 


T\7y 

iyr 






97 S 










9Rn 










9fl ^ 








Asn 


Cys 




iJC X 


PTn 

JTX 


T v/y 


Al ^4 
nXO 


A 


f^l V 

vjxy 


Ayrr 
nx ^ 


T .Pti 
XiC u 


T\/y 

iyr 


CI n 


Pyo 

IT X 


T\f y 

^ y^ 


ox y 




290 




















JKJKJ 










Asp 


Ser 


Ala 


vpx y 


^P T 




nxo 


O C?X 


Thy 

i. IIX 


Qpy 

OCX 


A yri 
rix ^ 


Qp y 
OCX 


n 1 \7 

oxy 


xiy o 


<?py 

OCX 


1 n 

O X 11 


305 










X V 










"^1 R 
J X «J 










'^90 

O^ w 


Pro 


Ser 


lyr 


T 1 fa 
X xc 


It X vj 


Php 






A YTt 
AX ^ 


ox u 


ux u 


c;py 

OCX 


Ala 
aXo 


Lieu 


A cr^ 


It X V./ 






























^ J ■J 




Gly 


Gin 


Glv 


Pro 


Pro 


Gly 


Leu 


Glv 

vjxy 


Asp 


Pro 


Gly 


Ser 


Leu 


Phe 


Leu 


Glu 








340 










J I J 
















Gly 


Gin 


Asp 


Gin 


Gin 


His 


Asp 


Pro 


Asp 


Thr 




spy 

OCX 


A ^y» 

AO^ 


Ser 


A 


Leu 






355 










360 










^ D J 








Ser 


Leu 


Glu 


Asp 


Asp 


Gin 


Ser 


Glv 
oxy 


Ser 


Tvy 
iyx 


Ala 


Ser 


Thr 


His 


Ser 


Ser 




370 




















"^ftO 
oou 










Asp 


Ser 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu. 


Glu 


Ala 


Ala 


Phe 


Pro 


385 










390 










395 










400 


Gly Glu 


Gin 


Gly 


Trp 


Asp 


Ser 


Leu 


Leu 


Gly 


Pro 


Gly 


Ala 


Glu 


Arg 


Leu 










405 










410 










415 




Pro 


Leu 


His 


Ser 


Thr 


Pro 


Lys 


Asp 


Gly 


Gly 


Pro 


Gly 


Pro 


Gly 


Lys 


Ala 








420 










425 










430 






Pro 


Trp 


Pro 


Gly 


Asp 


Phe 


Gly 


Thr 


Thr 


Ala 


Lys 


Glu 


Ser 


Ser 


Gly 


Asn 






435 










440 










445 








Gly Ala 


Pro 


Glu 


Glu 


Arg 


Leu 


Arg 


Glu 


Asn 


Gly 


Asp 


Ala 


Leu 


Ser 


Arg 
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450 














Glu 


Glv 


Ser 


Leu 


Gly 


Pro 


Leu 


Pro 


4 65 










470 






Gly 


He 


Leu 


Lys 


Lys 
485 


Lys 


Cys 


Leu 


Leu 


Leu 


Arg 


Leu 
500 


Pro 


Leu 


Glu 


Gin 


Ser 


Ala 


Ser Glu Gly Ser 


Arg 


Glv 






515 










520 


Arg 


Gin 
530 


Ser 


Leu 


Gin 


Glu 


Gin 

535 


Leu 


Ser 


He 


Lys 


Ala 


Gly 


Thr 


Val 


Asp 


545 










550 






Asp 


Glu 


Thr 


Ser 


He 









565 



4 60 



oi y 


Cat" 


OCX, 


Ala 


Gin 


Pro 


His 


Lys 






4 / O 










480 


Pro 


X nr 


T 1 o 


Ser 


Glu 


Lys 


Ser 


Ser 














4 95 




v^ys 




ijiy 


Ser 


Ser 


Arg 


Gly Ser 


505 










510 






Gly 


Pro 


Pro 


Pro Arg 


Pro 


Pro 


Pro 










525 








Asn 


Gly 


Val 


Met 


Pro 


He 


Ala 


Met 








540 










Glu 


Asp 


Ser 


Ser 


Gly 


Ser 


Asp 


Ser 






555 










560 



<210> 101 
<211> 3748 
<212> DNA 

<213> Homo sapiens 
<400> 101 

gggaacccag gcatgaatga ccatgcccct cacattctgt accctacctc aaccaactcg 60 
tcagcagcct tcgagatggt gcctcgaact gcccctgctg gctacctggt caccaaagtc 120 
atagctatgg actcagactc tgggcaaaat gcttggcttt tttaccatct agcccagact 180 
tctgacctgg acctctttaa ggtagagctg cacacaggag aaattaggac taccaggaag 240 
atgggagatg agagtggtag cactttcaac ctgaccgtgg tggtccgaga taatggagag 300 
ccatcactat cagcctctgt ggccattaca gtagctgtgg tggatagggt ttccaaaatc 360 
ctccctgaca ctcagaggca tgttaagagc cctcggacat actctgaaat taccctttat 420 
ctaataatag cattaagcac agtgtctttt atatttcttt tgacaatcat cattttgagc 480 
atcatcaagt gctaccgcta cactgcgtat ggcactgcat gctgtggagg cttctgtgga 540 
gtaagggaaa ggtcccctgc agaactgtac aaacaagcca acaacaatat tgatgccagg 600 
ataccgcatg gcctcaaagt gcagcctcac ttcattgaag ttcgagggaa tggctccctc 660 
accaagacct actgctacaa ggcctgtctg acagcaggct cagggagtga cactttcatg 720 
ttttacaata caggggccca gacaggacca gggccttcgg gagcccaagc agcagtgact 780 
gacagcagga atctcacagg ccaaagtggt cagaatgctg ggaacctgat tattctcaaa 840 
aatgaggctg tttctcaaaa tgagccacga cagcccaacc ctgactggcg ttactctgcc 900 
tccctgagag caggcatgca cagctctgtg cacctagagg aggctggcat tctacgggct 960 
ggtccaggag ggcctgatca gcagtggcca acagtatcca gtgcaacacc agaaccagag 1020 
gcaggagaag tgtcccctcc agtcggtgcg ggtgtcaaca gcaacagctg gacctttaaa 1080 
tacggaccag gcaaccccaa acaatccggt cccggtgagt tgcccgacaa attcattatc 1140 
ccaggatctc ctgcaatcat ctccatccgg caggagccta ctaacagcca aattgacaaa 1200 
agtgacttca taaccttcgg caaaaaggag gagaccaaga aaaagaagaa aaagaagaag 1260 
ggtaacaaga cccaggagaa aaaagagaaa gggaacagca cgactgacaa cagtgaccag 1320 
tgaggtcctc aaatggaaac aagccactta gccagttttt gtaataatgg caaatctctc 1380 
ccatgtagca attccctgct cctttttcct atctacatga gccctcttag agacctcaga 1440 
aatctgcaga aagttccctg tgtctgtcta gaacgcattt aacaggtttt gtcgtaaaag 1500 
ctttactaag tctggtgtta actctttctc tccactctgg cttgttttca gaacctaaaa 1560 
agcagaccca agtttccttt ctcctccgcc gcaaaggaga ggcttcccag ccccgccagt 1620 
gagaggttgg actctctgcc ctgtgctccg gggatcctgt cttgatgaca cttgcagggc 1680 
aggctgaaaa gttttgagat tgagcagctt gggagtttgt ggccactggg tatgtgtggc 1740 
taccgcgggt atgcgagtgc cagatattgg ctgagacgag ccagcttaga ctaattggta 1800 
caaggaaggc aagaaaacaa agacaaataa acagcggaag ttatcagtat ggaggggaag 1860 
tgtaaactta aagggaccag actttctaaa tcttacaact caagaggtgg cagccaccct 1920 
ctaggagaca aaactacccc cactgacaag gctttaggag accctaaagt ctgatggctg 1980 
tgacgtcatt atacctaaaa tctgcatcat acctgcaagc caacagttca gtgttttaac 2040 
agagaaccac cctgggaaac agaagcagat ctgatgtgtt tcctatacat gtcctgtgct 2100 
cactttatta aaaattcttt tgcacacaat gtttatgaaa aggccagatc cttttccaat 2160 
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acttatgcaa aagcaaaaga aaaccccgac acctcacctt tcgctgtttg ttgtttcata 2220 
gatttattta aaaaaagaga aagtctatag ctataaatct ttaaagagaa atatgaatac 2280 
aattccccta aactctcctc aaaagagaat tcagtctaca gccatttaaa tgatcattgc 2340 
tgctacagaa gtgctttaag agaattgcct gaaacatctg tattatatcg gccacctgcc 24 00 
aatcacagct ttactctttc aggtcactct ggggctgcct cttgcatgta ttactaaata 24 60 
aaatgatctc tctttctctc tctctctctc ttttctaaga aacaattatg tgcactttga 2520 
tacacaacct tctctaacca actatatatc aagacccaaa aattgaagaa aaatattgtt 2580 
ttctcataca gtgagcagat ttttcaatct actaattctg tgacttgtct tggtgtgcta 2640 
gcctacacct tctctttggt ttagttttcc ttttctataa cactctgaat tgctaatctt 2700 
actaacacct atgatgttac ctgaaatcaa tctcccatat gtatgctgta tgctatgcta 2760 
agactcctga aatatactta ctctgtgctt gtgtatgtga atgttaatgc aactattacc 2820 
tagagtgaac tttaagcttt attgttgaat gtaattccat tatatttcct tttgtacacc 2880 
tgtgaaaaag tggagtagtg tttttttaac cattgttaat cagcttttgt gtatgaaaga 2940 
cacagtaaaa tttctttctt aaatcaagat actggtgatt caaggaattt tatttatggt 3000 
ccagccaaga gccatctcgt gccaagactt ctgctggcaa gggaatggat aaagctgttt 3060 
tgttctagta acaattttgg aatgaatact gacaatattc catgagggtg tgcaagcaca 3120 
aattttacca atctgacctc tttgaagttg cagaatgctt tgaaattcta atggtatctg 3180 
aaatatcagc tcatagaaag taacaaaatt tgctgtcacc ttaaataaga cattttaatt 3240 
ttgttataat gtacaattta gaagtttgat taattatatt atctatttag gcattaatat 3300 
aaaagaggta ggagtctgtt atttaaaaaa agcattaaat ttaaaaaaaa actgtcttgt 3360 
ctacttttag cttcattctc ccatattttg aagggtgtgt aacttcagct ctgcaggatt 3420 
gccatggggt aaaacttgtt acccaacaca tgtgaaccat tgcctacatt gtaggttgtg 3480 
atcattttgc cccactgaag cccatgtatc tgaccttacg tgccttttga actaggagaa 3540 
tcgggctaat ttattaatga tgataattat aatgtatctg tacagcactt tttacatttg 3600 
cgaagtgctt tccaatccat gttagttact agttattaca gctgtaagga taaaacacgt 3660 
catgtggatt cattttgaat tggtgctatt ggtatttcct ctgttattgc taataaatga 3720 
aaatggtggt atgaaaaaaa aaaaaaaa 3748 

<210> 102 
<211> 436 
<212> PRT 

<213> Homo sapiens 



<400> 102 



Met 


Asn 


Asp 


His 


Ala 


Pro 


His 


He 


Leu 


Tyr 


Pro 


Thr 


Ser 


Thr 


Asn 


Ser 


1 








5 










10 










15 




Ser 


Ala 


Ala 


Phe 


Glu 


Met 


Val 


Pro 


Arg 


Thr 


Ala 


Pro 


Ala 


Gly 


Tyr 


Leu 








20 










25 










30 




Val 


Thr 


Lys 


Val 


He 


Ala 


Met 


Asp 


Ser 


Asp 


Ser 


Gly 


Gin 


Asn 


Ala 


Trp 






35 










40 










45 






Leu 


Phe 


Tyr 


His 


Leu 


Ala 


Gin 


Thr 


Ser 


Asp 


Leu 


Asp 


Leu 


Phe 


Lys 


Val 




50 










55 










60 








Glu 


Leu 


His 


Thr 


Gly 


Glu 


He 


Arg 


Thr 


Thr 


Arg 


Lys 


Met 


Gly 


Asp 


Glu 


65 










70 










75 








80 


Ser Gly 


Ser 


Thr 


Phe 


Asn 


Leu 


Thr 


Val 


Val 


Val 


Arg 


Asp 


Asn 


Gly 


Glu 










85 










90 










95 




Pro 


Ser 


Leu 


Ser 


Ala 


Ser 


Val 


Ala 


He 


Thr 


Val 


Ala 


Val 


Val 


Asp 


Arg 








100 










105 










110 


Val 


Ser 


Lys 


He 


Leu 


Pro 


Asp 


Thr 


Gin 


Arg 


His 


Val 


Lys 


Ser 


Pro 


Arg 






115 










120 










125 






Thr 


Tyr 
130 


Ser 


Glu 


He 


Thr 


Leu 
135 


Tyr 


Leu 


He 


He 


Ala 
140 


Leu 


Ser 


Thr 


Val 


Ser 


Phe 


He 


Phe 


Leu 


Leu 


Thr 


He 


He 


He 


Leu 


Ser 


He 


He 


Lys 


Cys 


145 










150 










155 








160 


Tyr Arg 


Tyr 


Thr 


Ala 


Tyr 


Gly 


Thr 


Ala 


Cys 


Cys 


Gly 


Gly 


Phe 


Cys 


Gly 










165 










170 










175 


Val 


Arg 


Glu 


Arg 
180 


Ser 


Pro 


Ala 


Glu 


Leu 
185 


Tyr 


Lys 


Gin 


Ala 


Asn 
190 


Asn 


Asn 


He 


Asp 


Ala 


Arg 


He 


Pro 


His 


Gly 


Leu 


Lys 


Val 


Gin 


Pro 


His 


Phe 


He 
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195 




Glu 


Val 


Arg 


Gly 




210 






Cys 


Leu 


Thr 

X 1 11. 




225 








Gly Ala 


Gin 


Thr 


Asp 


Ser 


Arg 


As n 








c ou 


lie 


He 


Leu 


Lys 






275 




Asn 


Pro 


Asp 


irp 




290 






Ser 


Val 




Jut; LI 


305 








Pro Asp 






Ala 


Gly 




Vox 










Trp 


Thr 


Phe 


Lys 






355 




Glu 


Leu 








370 






lie 


Arg 




mil 


385 








Thr 


Phe 


Gly 


Lys 


Gly Asn 


Lys 


Thr 








420 


Asn 


Ser 


Asp 


Gin 






435 





200 

Asn Gly Ser Leu 

215 

Gly Ser Gly Ser 
230 

Gly Pro Gly Pro 
245 

Leu Thr Gly Gin 

Asn Glu Ala Val 
280 

Arg Tyr Ser Ala 

295 

Glu Glu Ala Gly 
310 

Trp Pro Thr Val 
325 

Ser Pro Pro Val 

Tyr Gly Pro Gly 

360 

Lys Phe, He He 
375 

Pro Thr Asn Ser 
390 

Lys Glu Glu Thr 
405 

Gin Glu Lys Lys 



1 nr 


Lys 


Thr Tyr 






220 


Asp 


Thr 


Phe Met 








oer 


uxy 


Aj.a oxn 




9Rn 




oer 


V3±y 


oJ.n Asn 


O C c: 






oer 


lain 


Asn Glu 


Ser 


Leu 


TV 1 

Arg Ala 






300 


He 


Leu 


Arg Ala 








oer 


Ser 


TV 1 a TKy 

/\±a 1 nr 








oJ.y 


Aia 


vjiy vai 








Asn 


• 

Pro 


ijys bxn 


Pro 


Gly 


Ser Pro 






380 


Gin 


He 


Asp Lys 






395 


Lys 


Lys 


Lys Lys 




410 




Glu 


Lys 


Gly Asn 


425 







205 



Cys 


Tyr 


Lys 


TV 1 

Ala 


Phe 


Tyr 


Asn 


Thr 








240 


Ala 


Ala 


Val 


Thr 






O R Q 




Aia 


biy 


TV Mao 

Asn 


Leu 




o T rv 
270 






rro 


Arg 


Gin 


Pro 


0 Q t 








biy 


Met 


His 


Ser 


biy 


Pro 


Gly 


Giy 








320 


Pro 


Glu 


Pro 


Glu 










Asn 


Ser 


7\ M M 

Asn 


O M w 

Ser 










Ser 


Gly 


Pro 


Gly 










Ala 


He 


He 


Ser 


Ser 


Asp 


Phe 


He 








400 


Lys 


Lys 


Lys 


Lys 






415 




Ser 


Thr 


Thr 


Asp 




430 







<210> 103 
<211> 2429 

<212> DNA 

<213> Homo sapiens 



<400> 103 

gtcgacccac 

ggtgcgcccg 

gttgcggcga 

agctgaccat 

gcctgatgaa 

gcatgttcga 

tcacttctac 

tgtgatgtgc 

tggaacttcc 

ctacggaagt 

cccaagagca 

ggtgatcttt 

aagtgtgata 

tataatggga 

tgccctctgc 

gtatcactac 

tattgtggct 

gtatgacaag 

tgtcgaggag 

tgactatgtg 



gcgtccgcgg 
ccaggtgagc 
gcggattggt 
tgacaatcag 
ttcaaaccga 
ccaatgctct 
aaatggacct 
attgccatct 
cttttaggag 
ggctatggct 
gcaaagggct 
gttaccagtg 
atagtgagtg 
gtgaacccaa 
aaccaatttt 
tgtgttgtgg 
tttgctttaa 
tccaatattt 
tgggttaaaa 
gaaagagttg 



acgcgtgggc 
gcgctccctg 
ttatcttgga 
ccatgtcatc 
atcattatgc 
ctcagccagc 
ctcctccagg 
ttgcctgtgt 
gtagtgtagg 
atggctatgg 
tcatgttggc 
ttataagatc 
ctatcctggg 
ctgctcagtc 
atacacctgc 
atccccagga 
taattttctt 
tgtgggacaa 
atgtgtctgc 
acagtcccat 



ggctgagcgc 
gcaccgttgg 
agctaaaggg 
caggcctctt 
accaagcaat 
ctactctttt 
agtgattcgg 
ggcctccacg 
ctacccttat 
ttatggctat 
catggctgcc 
tgaaatgtcc 
catcatggtg 
ttctggatct 
agctactgga 
ggccattgcc 
tgctgtgaaa 
ggaacacatt 
aggcacacag 
ggcatactct 



tggcggtcgg 
cccccggagg 
cattgctcat 
gaaagtccac 
gacatatatg 
tacccagaag 
atcctgtcta 
cttgcctggg 
ggaggaagtg 
ggctacggag 
ttttgtttca 
agaacaagaa 
tttattgcca 
ctatatggtt 
ctctacgtgg 
attgtactgg 
actcgaagaa 
tatgatgagc 
gacgtgcctt 
tccaatggca 



tgcggcgtca 
gtcgggccca 
cctgaagatc 
ctccttacag 
gtggagagat 
atgaaattct 
tgctcattat 
acagaggcta 
gctttggtag 
gctatacaga 
ttgccgcgtt 
gatactactt 
caattgtcta 
cacaaatata 
atcagtattt 
ggttcatgat 
agatggacag 
agccccccaa 
cacccccatc 
aagtgaatga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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caagcggttt tatccagagt cttcctataa atccacgccg gttcctgaag tggttcagga 1260 

gcttccatta acttcgcctg tggatgactt caggcagcct cgttacagca gcggtggtaa 1320 

ctttgagaca ccttcaaaaa gagcacctgc aaagggaaga gcaggaaggt caaagagaac 1380 

agagcaagat cactatgaga cagactacac aactggcggc gagtcctgtg atgagctgga 14 4 0 

ggaggactgg atcagggaat atccacctat cacttcagat caacaaagac aactgtacaa 1500 

gaggaatttt gacactggcc tacaggaata caagagctta caatcagaac ttgatgagat 1560 

caataaagaa ctctcccgtt tggataaaga attggatgac tatagagaag aaagtgaaga 1620 

gtacatggct gctgctgatg aatacaatag actgaagcaa gtgaagggat ctgcagatta 1680 

caaaagtaag aagaatcatt gcaagcagtt aaagagcaaa ttgtcacaca tcaagaagat 1740 

ggttggagac tatgatagac agaaaacata gaaggctgat gccaagttgt ttgagaaatt 1800 

aagtatctga catctctgca atcttctcag aaggcaaatg actttggacc ataaccccgg 1860 

aagccaaacc tctgtgagca tcacaaagtt ttggttgctt taacatcatc agtattgaag 1920 

cattttataa atcgcttttg ataatcaact gggctgaaca ctccaattaa ggattttatg 1980 

ctttaaacat tggttcttgt attaagaatg aaatactgtt tgaggttttt aagccttaaa 2040 

ggaaggttct ggtgtgaact aaactttcac accccagacg atgtcttcat acctacatgt 2100 

atttgtttgc ataggtgatc tcatttaatc ctctcaacca cctttcagat aactgttatt 2160 

tataatcact tttttccaca taaggaaact gggttcctgc aatgaagtct ctgaagtgaa 2220 

actgcttgtt tcctagcaca cacttttggt taagtctgtt ttatgacttc attaataata 2280 

aattccctgg cctttcatat tttagctact atatatgtga tgatctacca gcctccctat 2340 

•tttttttctg ttatataaat ggttaaaaga ggtttttctt aaataataaa gatcatgtaa 2400 

aagtaaaaaa aaaaaaaaag ggcggccgc 2429 

<210> 104 
<211> 522 
<212> PRT 

<213> Homo sapiens 

<400> 104 

Met Ser Ser Arg Pro Leu Glu Ser Pro Pro Pro Tyr Arg Pro Asp Glu 

1 5. 10 15 

Phe Lys Pro Asn His Tyr Ala Pro Ser Asn Asp lie Tyr Gly Gly Glu 

20 25 30 

Met His Val Arg Pro Met Leu Ser Gin Pro Ala Tyr Ser Phe Tyr Pro 

35 . 40 45 

Glu Asp Glu lie Leu His Phe Tyr Lys Trp Thr Ser Pro Pro Gly Val 

50 55 60 

lie Arg He Leu Ser Met Leu He He Val Met Cys He Ala He Phe 
65 70 75 80 

Ala Cys Val Ala Ser Thr Leu Ala Trp Asp Arg Gly Tyr Gly Thr Ser 

85 90 95 

Leu Leu Gly Gly Ser Val Gly Tyr Pro Tyr Gly Gly Ser Gly Phe Gly 

100 105 110 

Ser Tyr Gly Ser Gly Tyr Gly Tyr Gly Tyr Gly Tyr Gly Tyr Gly Tyr 

115 120 125 

Gly Gly Tyr Thr Asp Pro Arg Ala Ala Lys Gly Phe Met Leu Ala Met 

130 135 140 

Ala Ala Phe Cys Phe He Ala Ala Leu Val He Phe Val Thr Ser Val 
145 150 155 160 

He Arg Ser Glu Met Ser Arg Thr Arg Arg Tyr Tyr Leu Ser Val He 

165 170 175 

He Val Ser Ala He Leu Gly He Met Val Phe He Ala Thr He Val 

180 185 190 

Tyr He Met Gly Val Asn Pro Thr Ala Gin Ser Ser Gly Ser Leu Tyr 

195 200 205 

Gly Ser Gin He Tyr Ala Leu Cys Asn Gin Phe Tyr Thr Pro Ala Ala 

210 215 220 

Thr Gly Leu Tyr Val Asp Gin Tyr Leu Tyr His Tyr Cys Val Val Asp 
225 230 235 240 

Pro Gin Glu Ala He Ala He Val Leu Gly Phe Met He He Val Ala 
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245 250 255 



IT lie 




Leu 


He 


Tie 

X xc 


Php 

IT i IC 


Ph^ 

IT 1 IC 


Al 
r^x o 


Val 
V ax 




Thr Arg Arg 


Lys 


Met 


Asp 








260 










265 










270 








T\/T" 

lyr 


Asp 


T.\7Q 

jjy o 


OCX 


A CD 


Tip 

X X c 


T .A1 1 
iJC Li 


. 1 xp 




Lys 


Glu 


His 


He 


Tyr 


Asp 






?7S 










PRO 
^ o 










285 






(Z^ n 


Gin 






A cn 


vox 


Glu 


f5 1 11 
vsx u 


1 rp 


Val 
vox 


Lys 


Asn 


Val 


Ser 


AX a 


biy 




290 










295 










300 








i fii 


1 1 




Vox 




OCX 




i X W 


OCX 


A <sr^ 


Tyr 


Val 


Glu 


Arg 


val 


Asp 












X u 










315 














Pro 






lyr 


OCX 


OCX 


A 

noil 


fil V 


Xiyo 


Val Asn Asp Lys 


Arg 


Fne 










325 










330 










J J 0 




lyr 






Q A V 

OCX 


Q A V 
OCX 


lyr 


Xjy o 


O o T- 

OcX 


1 1 IX 


Pro 

t X u 


V a± 


Pro 


Glu 


val 


val 


bin 








34 0 










34 5 










350 






Glu 


Leu 




Leu 


1 llX 


Ser 


Pro 


1 

V dx 


ZV e r> 


A 

ribp 


Phe 


Arg 


Gin 


Pro 


Arg 


Tyr 






355 










360 










365 








Ser 


Ser 




vaxy 


Asn 


Phe 


Glu 


1 1 Ix 


D v" 

tr X O 


C A V 
OCX 


Lys 


Arg Ala 


rro 


Ala 


Lys 




370 










375 










380 










Gly 


Arg 


ft 1 a 


vjxy 


Arg 


Ser 


Lys 


Arg 


1 liX 


uxU 


Gin 


Asp 


His 


Tyr 


blU 


rp v» V 

1 nr 


385 










390 










395 












Asp Tyr 


Th-r 
1 III. 


Th r- 


^axy 


Gly Glu 


Ser 


Cys 


Asp 


Glu 


Leu 


Glu 


Glu 


Asp 


Trp 










405 










4 1 n 

•a X V 










HID 




He 


Arg 


mil 

U J. Li 


lyr 


tr i O 


Pro 


He 


Thr 


Q /-» V 
OCX 


Asp 


Gin 


Gin 


Arg 


Gin 


Leu 


Tyr 


















4 9 S 

ri ^ J 










430 






Lys 


Arg 




IT 1 IC 


Asp 


Thr Gly Leu 


o±n 


u±U 


Tyr 


Lys 


Ser 


Leu 


bin 


Ser 






435 










440 










445 








Glu 


Leu 


Asp 


uXU 


ixe 


Asn 


Lys 


Glu 


Leu 


oer 


Arg 


Leu 


Asp 


Lys 


Glu 


Leu 




450 










455 










460 










Asp 


Asp 


Tyr 


Arg 


Glu 


Glu 


Ser 


Glu 


Glu 


Tyr 


Met 


Ala 


Ala 


Ala 


Asp 


Glu 


465 










470 










475 










480 


Tyr 


Asn 


Arg 


Leu 


Lys 


Gin 


Val 


Lys 


Gly 


Ser 


Ala Asp Tyr 


Lys 


Ser 


Lys 










485 










490 










495 




Lys 


Asn 


His 


Cys 


Lys 


Gin 


Leu 


Lys 


Ser 


Lys 


Leu 


Ser 


His 


He 


Lys 


Lys 








500 










505 










510 






Met 


Val 


Gly 


Asp 


Tyr 


Asp Arg 


Gin 


Lys 


Thr 















515 520 



<210> 105 

<211> 2985 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 15 

<223> n = A,T,C or G 
<400> 105 

cccgcgtccg gattnttttg ataagtacgg gacgttattg atggtagaga atatttttat 60 

agactgttgg agttggaagt cttcttgctt ggcattaagg aagagactga taaaggtaat 120 

gggtgtgaac cactcttaaa gatgtaggca atgaaaaaat agcctggaga gagaacaagg 180 

ccaagtgaag tttgtcattc cccacctccc cccaccctcc atcttccaaa ccaaggagaa 240 

ggagccagtg gagaacaaag gagcttaagg aacattcgag taaagttcct caagattcag 300 

tagtagatct taaaaatgaa atgtattagg atatattaca tatggactgt ttctataata 360 

tactcttcct ttctcctctc agctttgaca tctttatata atagcatgat attttactta 420 

catatatctt taaaaaatca ttctatagga gtgtccctag ttgtaacaga aactgtcgat 480 

gcaggtttat ttggagaagg attggggaga gttttgattc atgcatggga gcatttactt 540 

ttacagccaa agaccaaagg tgaaagtgct aattgtgaaa agtatgggaa agttatacca 600 

gcaagtgctg ttatatttgg gatggcagta gaatgtgcag agataagaag acatcataga 660 
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gtgggtatta 
gcttattctt 

cgagttcctg 
aacaaccttc 
gttattaaag 
gaacatagtt 
gtacaggacc 
tgtcaagact 
gatgttgcaa 
ctttgtagtg 
atattggaat 
atggcaatga 
actcactgtc 
cttgaacctt 
ggcacacttg 
ctggacctca 
catgtggagg 
attatattgg 
gcaatatcca 
tttgggttgc 
gtactcttgg 
ttttggacct 
tagatttaat 
aatctggaag 
ggttggatac 
ccatccaggt 
tcagcatcac 
tcaacaagta 
tcaaatattt 
aaaamgsayk 
agttttttaa 
ttatcgttat 
ttttcttaaa 
gttatggtta 
catctcaggt 
gtaattaaaa 
ctactttggc 
cttaatccta 
tccattttta 



aggacattgc 
ctgtagatac 
gaggatattt 
aggaattaaa 
aaggcatact 
tatccacaag 
ttgcagacta 
gttacttaag 
aaagttttac 
ccctcgctaa 
ctacagaaat 
gcaaagtttt 
agaatgagat 
tctacgtgtt 
ggcaggttaa 
tatctgaagc 
atgaatctgc 
atgtcattga 
ggtgtttact 
ttgtggaatc 
attaaatata 
atcctcattg 
gactccgtat 
actgactgct 
ttcaagtgaa 
tgaaatggga 
agtaaagcaa 
caaaattctt 
tttaaaattg 
mkkrkmgamw 
attaggaaac 
tgacatgtgt 
tccaaacttg 
cctgtaaatt 
tttgttttga 
cagtaaagga 
aaaagaacat 
tactattttt 
caaacctttt 



tggtatccat 
tgaagaaaca 
ggctttgaca 
aagtcttgca 
agatgctatt 
tcctagtgag 
ctggataaag 
aatccagagt 
ccagaataaa 
ccttcagacc 
tgctttgctt 
gtcttcactg 
gagctctgga 
agatgtgtcc 
accatacagt 
caatcacttt 
tatgttacaa 
gccatctggg 
acaatctgga 
acagacactc 
gcacctttta 
ccctgtatac 
ttgaacacct 
attccatttt 
gcctcccact 
gaggaacttg 
tgaagagcag 
gtcttaatta 
acattaataa 
ymctgctgca 
ttcttttgct 
aagtaaagca 
taattgggaa 
ccaatcttgc 
gaatgcaagc 
gttagggaat 
ttaaaagctg 
caaagttttg 
gaataaggag 



ttgccaacaa 
attgaacctt 
gagtgctttg 
actaaaaagc 
atggtttggt 
gaaacatgtt 
cctggagacc 
attagtgtct 
gacttgttat 
agtaaaccag 
aacaacatcc 
actccagaga 
actggacaga 
gaaggcttct 
tctgtggaga 
cctaaagaaa 
aggccaaaat 
ctcattcagc 
ggcaagatct 
ctagaggaga 
ttaaccagtt 
ctttaagcaa 
ctaacagaga 
ggtatcatat 
ggaaacaagc 
tactcagcat 
ttttccaatg 
gtgggggtat 
agcatatt tt 
gatttgcttt 
cgattttaca 
aaactttact 
acactgaaag 
ttcctttaaa 
ttgaaaaaga 
aaatcttcag 
gtaacaaaac 
atttggatgt 
atcataatat 



atgtgaaatt 
atacaactga 
aaattatgac 
ctgataagat 
ttgtgctcca 
gggaacaggc 
atgtgatgat 
tgggtttgga 
cgttaggaaa 
atgctgtaga 
catatcatga 
aactgtatca 
gtaatactgt 
ctgttctgcc 
aagaccagca 
cacttgagtt 
cagacaagtt 
aggaaataat 
ttcctcagta 
atgctgttca 
tcaggtacct 
gccagtggaa 
agtaaaggta 
gtaccttgat 
tgcagttgtt 
tcagcatcac 
aaaactgtgt 
ataaaaattc 
aaaagattct 
ctggaaaagg 
gaatagggat 
tttgtaggca 
gcttccactg 
tactcagtgt 
atttaagcta 
gaggcagcat 
aaagttaaat 
acaataagta 
gcctc 



tcagagtccg 
aaagatgagt 
agtagatttc 
tggtattcct 
gcttgatgat 
tgtctacccc 
ggaagtatct 
atgtgaaatg 
tgaggctgaa 
gcagacatgt 
aggctttaaa 
gaccatggat 
acagaacatc 
tgttattgct 
tcgtattgct 
ttggctgaga 
atggagcata 
ggaaaaagct 
tgtgctgatg 
aggaacagaa 
atacgtgtat 
ctcttaagac 
tacgtttgta 
gaagagatta 
ttagataatc 
aaaagcaatg 
aaatagagca 
cttgtaatgg 
aaaaaaaaaa 
atacatcact 
tttaaaagtc 
tcttggcctt 
aagactgagg 
acatctgaaa 
taagctaaat 
ttttcttggt 
tgaaggaaga 
cattaattga 



720 
780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

2985 



<210> 106 
<211> 519 
<212> PRT 

<213> Homo sapiens 



<400> 106 

Met Lys Cys He Arg lie Tyr Tyr lie Trp Thr Val Ser He lie Tyr 

15 10 15 

Ser Ser Phe Leu Leu Ser Ala Leu Thr Ser Leu Tyr Asn Ser Met He 

20 25 30 

Phe Tyr Leu His He Ser Leu Lys Asn His Ser He Giy Val Ser Leu 

35 40 45 

Val Val Thr Giu Thr Val Asp Aia Giy Leu Phe Giy Glu Giy Leu Giy 

50 55 60 

Arg Val Leu lie His Aia Trp Giu His Leu Leu Leu Gin Pro Lys Thr 
65 70 75 80 

Lys Giy Glu Ser Ala Asn Cys Glu Lys Tyr Giy Lys Val He Pro Ala 

85 90 95 

Ser Aia Val lie Phe Giy Met Aia Vai Giu Cys Aia Giu He Arg Arg 



wo 03/000012 



133 



PCT/US02/19773 



100 



U -1 e Hi*; 




vox 


ox y 


Tl a 
XXc 


Lys 


rtop 




ii5 










1 90 

X ^ u 


a c n Ma "I 


Ly s 


t 1 IC 


m n 

OXll 


C 0 -r 
OcX 


Pro 


Ala 


130 

X ~J \J 










1 3^ 




Thr T 1 


Glu 


t X W 


lyr 


1 IIX 


Thr 

X J iX 


ox u 


X 4 








X J V 








nxa 


_ 

Leu 


i. Iix 


V3 X Ll 


v^yo 


Ph^ 
1 1 Ic 








X O 








noli JJC LL 




V9X U 


Leu 


Lys 


Qar 
OCX 


Leu 






180 










vjx y i X ^ 




vox 


X X c 


T \7Q 

Xty 0 


0 X u 


ox y 




195 










200 


phpk 1 
triic; vczx 


Leu 




Leu 


21 CO 
riap 


rxo p 


ox u 


210 










215 




ox u ox u 


Thr 

1 ILL 


wys 


irp 


ox u 


rji n 

OXll 


Al ;5 
nX o 


Z ^ ^ 








9 "^n 






Asp i yx 


irp 


T 1 o 
XXc 


Lys 


IT X (J 


ox y 


Asp 
















uXtl rio^ 


\^ys 


lyr 


Leu 


riX y 


XXc 


CI n 
OXli 






D U 










P'v/c Pin 

^»^yb oXU 


Maf- 


riSp 


V dX 


a 1 a 
rAXd 


Lys 


oe i 




^ 1 o 










9ft n 
^ 0 u 




Lixy 


Asn 


kjXU 


Mxa 


oXU 


Leu 
















I riL OCX 


Lys 


D V 

rx O 


Asp 


x-iXa 


va X 


oXU 










"5 1 n 
J iU 








AXo 


Leu 


Lsu 


Asn 


Asn 


T 1 a 

J .Le 












* 




TV 1 a Ma-t- 


Ser 


Lys 


vai 


Leu 


Ser 


Ser 






J *a U 










i IlX ITICS L 


rtop 


I lix 


Hie 

n X 0 


L^y 5 


Pin 

oxn 


A 0 




O ^ U 












Q o V & e n 

OCX risn 


1 liX 


VdX 


oxn 


Asn 


T 1 a 
XXc 


Leu 












37*1 
J f •J 




Ocrx UXLI 


oxy 


Pho 

Iriic 


OCX 


V a X 


Leu 


Jrx 0 


3ft S 








3Qn 






V a X Xjy o 


It X U 


lyr 


^a r- 

OCX 


OCX 


V aX 


oXu 








H\JD 








ZL cr^ Tail 


X Xc 


O o y 

OCX 


uX u 




As n 


nx s 






420 










i. X ^ XjC u 


7\ >- n 

•rtx y 


n X o 


V d X 


OX u 


rtSp 


OXU 




•a J J 










4 4 0 
H fl U 


OCX nop 


Lys 


Leu 


irp 


OCX 


T 1 a 
XXc 


T 1 a 
XXc 


450 










455 




Gly Leu 


He 


Gin 


Gin 


Glu 


He 


Met 


465 








470 






Leu Leu 


Gin 


Ser 


Gly 


Gly 


Lys 


He 








485 








Gly Leu 


Leu 


Val 


Glu 


Ser 


Gin 


Thr 






500 










Gly Thr 


Glu 


Val 


Leu 


Leu 


Asp 





515 



<210> 107 

<211> 2467 

<212> DNA 

<213> Homo sapiens 



105 110 



Tl fa 
XXc 


AT A 


ox y 


He 


His 


Leu 


Pro 


Th -K- 

i nr 










125 








Tvr 
lyr 


OCX 


Car 

OCX 


Val 


Asp Thr 


Pill 


P1 11 

oXU 








140 










Lys 


Met 


C Ar 

OCX 


Arg Val 


Pro 


oxy 


<^1 \7 

oxy 






X 3 










lou 


Glu 


Hp 

XXC 


l IC L 


Th 

1 nr 


val 


TV nn 

Asp 


Pha 

rrne 


Asn 




170 

X / w 










1 7 R 
X / o 




Al A 
AX CI 


Thr 

X 1 IX 


Lys 


Lys 


Pro 


Asp 


Lys 


T 1 a 

1 j.e 


1 ft s 
X 0 «^ 










lyu 






XXC 


T .a 1 1 
Xjc u 


A c 
r\a p 


Ala 


He 


iyiet 


vax 


Trp 










0 n t 








nx o 


Car 
OCX 


Leu 


oer 


rp U, ^ 

inr 


oer 


Pro 


Ca T- 

oer 








0 0 n 










Va 1 
V d X 


T\f T* 

iyr 


Pro 

Erx 0 


Val 


Gin 


Asp 


Leu 


Ala 

AX a 






9 

^ J 0 










9 / n 


nxo 


1 

Vox 


Mat* 


Met 


Glu 


vax 


Ser 


Cys 




9=in 














Q a "T* 

OCX 


T 1 a 
XXc 


Car 

OCX 


Val 


Leu 


PI** 
oiy 


Leu 


PT 11 
oXU 


265 










0 T n 






Ph a 


i nx 


Pin 

oxn 


Asn 


Lys 


Asp 


Leu 


Leu 










285 








v-»y S 


Ser 


AX a 


Leu 


Ala 


Asn 


Leu 


oxn 








300 










oxn 


inr 


Cys 


He 


Leu 


Glu 


Ser 


TK V 

i nr 






J i-D 












Pro 


Tyr 


nis 


Glu Gly 


Phe 


Lys 


Met 




ooU 










0 jD 




Leu 


Thv 

i nr 


Pro 


Glu 


Lys 


Leu 


Tyr 


PI r% 

oxn 


34 s 










350 






Pill 

oX U 


Me L. 


Ser 


Ser 


Gly 


Thr 


biy 


bin 










365 








Pin 
oXU 


fro 


Dh a 


Tyr 


Val 


Leu 


Asp 


vai 








380 










vax 


T 1 a 

xxe 


Ala 
AX a 


Gly Thr Leu 


PI \w 

ijxy 


oin 






J ^ J 










H uu 


Lys 


Asp 


oxn 


His 


Arg 


He 


Zll =» 

AX a 


Leu 




410 










4 1 S 

*3 X «^ 




Ph^ 


Pro 

Er X vj 


Lys 


Glu 


Thr 


Leu 


oXU 


Pha 
rHc 


^ 0 










430 




■ 


Qa 
OCX 


Al ^ 
rtx a 


1*1 c L 


Leu 


Gin 


Arg 


irx 0 


Lys 










445 








T 1 a 

lie 


Leu 


Asp 


Val 


He 


Glu 


Pro 


Ser 








4 60 










Glu 


Lys 


Ala 


Ala 


He 


Ser 


Arg 


Cys 






475 










480 


Phe 


Pro 


Gin 


Tyr 


Val 


Leu 


Met 


Phe 




490 










495 




Leu 


Leu 


Glu 


Glu 


Asn 


Ala 


Val 


Gin 


505 










510 
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<220> 

<221> misc_feature 

<222> 2453, 2454, 2455, 2456, 2457, 2458, 2459, 2460, 2461, 2462, 
2463, 2464, 2465, 2466, 2467 
<223> n = A,T,C or G 

<400> 107 

cgaccacgcg tccgcccgca gcggccgagc tgcagcccgg gctcagtctc cgccgccgcc 60 
gtgaacatgg agcccccgga cgcaccggcc caggcgcgcg gggccccgcg gctgctgttg 120 
ctcgcagtcc tgctggcggc gcacccagat gcccaggcgg aggtgcgctt gtctgtaccc 180 
ccgctggtgg aggtgatgcg aggaaagtct gtcattctgg actgcacccc tacgggaacc 240 
cacgaccatt atatgctgga atggttcctt accgaccgct cgggagctcg cccccgccta 300 
gcctcggctg agatgcaggg ctctgagctc caggtcacaa tgcacgacac ccggggccgc 360 
agtcccccat accagctgga ctcccagggg cgcctggtgc tggctgaggc ccaggtgggc 420 
gacgagcgag actacgtgtg cgtggtgagg gcaggggcgg caggcactgc tgaggccact 4 80 
gcgcggctca acgtgtttgc aaagccagag gccactgagg tctcccccaa caaagggaca 54 0 
ctgtctgtga tggaggactc tgcccaggag atcgccacct gcaacagccg gaacgggaac 600 
ccggccccca agatcacgtg gtatcgcaac gggcagcgcc tggaggtgcc cgtagagatg 660 
aacccagagg gctacatgac cagccgcacg gtccgggagg cctcgggcct gctctccctc 720 
accagcaccc tctacctgcg gctccgcaag gatgaccgag acgccagctt ccactgcgcc 780 
gcccactaca gcctgcccga gggccgccac ggccgcctgg acagccccac cttccacctc 84 0 
accctgcact atcccacgga gcacgtgcag ttctgggtgg gcagcccgtc caccccagca 900 
ggctgggtac gcgagggtga cactgtccag ctgctctgcc ggggggacgg cagccccagc 960 
ccggagtata cgcttttccg ccttcaggat gagcaggagg aagtgctgaa tgtgaatctc 1020 
gaggggaact tgaccctgga gggagtgacc cggggccaga gcgggaccta tggctgcaga 1080 
gtggaggatt acgacgcggc agatgacgtg cagctctcca agacgctgga gctgcgcgtg 1140 
gcctatctgg accccctgga gctcagcgag gggaaggtgc tttccttacc tctaaacagc 1200 
agtgcagtcg tgaactgctc cgtgcacggc ctgcccaccc ctgccctacg ctggaccaag 1260 
gactccactc ccctgggcga tggccccatg. ctgtcgctca gttctatcac cttcgattcc 1320 
aatggcacct acgtatgtga ggcctccctg cccacagtcc cggtcctcag ccgcacccag 1380 
aacttcacgc tgctggtcca aggctcgcca gagctaaaga cagcggaaat agagcccaag 14 40 
gcagatggca gctggaggga aggagacgaa gtcacactca tctgctctgc ccgcggccat 1500 
ccagacccca aactcagctg gagccaattg gggggcagcc ccgcagagcc aatccccgga 1560 
cggcagggtt gggtgagcag ctctctgacc ctgaaagtga ccagcgccct gagccgcgat 1620 
ggcatctcct gtgaagcctc caacccccac gggaacaagc gccatgtctt ccacttcggc 1680 
gccgtgagcc cccagacctc ccaggctgga gtggccgtca tggccgtggc cgtcagcgtg 1740 
ggcctcctgc tcctcgtcgt tgctgtcttc tactgcgtga gacgcaaagg gggcccctgc 1800 
tgccgccagc ggcgggagaa gggggctccg ccgccagggg agccagggct gagccactcg 1860 
gggtcggagc aaccagagca gaccggcctt ctcatgggag gtgcctccgg aggagccagg 1920 
ggtggcagcg ggggcttcgg agacgagtgc tgagccaaga acctcctaga ggctgtccct 1980 
ggacctggag ctgcaggcat cagagaacca gccctgctca cgccatgccc gcccccgcct 204 0 
tccctcttcc ctcttccctc tccctgccca gccctccctt ccttcctctg ccggcaaggc 2100 
agggacccac agtggctgcc tgcctccggg agggaaggag agggagggtg ggtgggtggg 2160 
agggggcctt cctccaggga atgtgactct cccaggcccc agaatagctc ctggacccaa 2220 
gcccaaggcc cagcctggga caaggctccg agggtcggct ggccggagct atttttacct 2280 
cccgcctccc ctgctggtcc ccccacctga cgtcttgctg cagagtctga cactggattc 2340 
ccccccctca ccccgcccct ggtcccactc ctgcccccgc cctacctccg ccccacccca 24 00 
tcatctgtgg acactggagt ctggaataaa tgctgtttgt cacatcaaca ccnnnnnnnn 24 60 
nnnnnnn 24 67 



<210> 108 

<211> 628 

<212> PRT 

<213> Homo sapiens 



<400> 108 

Met Glu Pro Pro Asp Ala Pro Ala Gin Ala Arg Gly Ala Pro Arg Leu 
15 10 15 
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L6U 


Leu 


Leu 


Ala 


Val 


Leu 


Leu 


Ala 








20 










Val 


Arg 


Leu 


Ser 


Val 


Pro 


Pro 


Leu 






35 










40 


Val 


lie 


Leu 


Asp 


Cys 


Thr 


Pro 


Thr 




50 










55 




Glu 


Tro 


Phe 


Leu 


Thr 


Asp 


Arg 


Ser 


65 










70 






Ala 


Glu 


Met 


Gin 


Glv 


Ser 


Glu 


Leu 










85 








Glv 


Arg 


Ser 


Pro 


Pro 


Tvr 


Gin 


Leu 








100 










Ala 


Glu 


Ala 


Gin 


Val 


Gly 


Asp 


Glu 






115 










120 


Ala 


Glv 


Ala 


Ala 


Glv 


Thr 


Ala 


Glu 




130 










135 




Ala 


Ly s 


Pro 


Glu 


Ala 


Thr 


Glu 


Val 


145 










150 






Val 


Met 


Glu 


Asp 


Ser 


Ala 


Gin 


Glu 










165 








Glv 

w J. jr 


Asn 


Pro 


Ala 


Pro 


Lys 


lie 


Thr 








180 










Glu 


Val 


Pro 


Val 


Glu 


Met 


Asn 


Pro 






195 










200 


Val 


Arg 


Glu 


Ala 


Ser 


\3X y 


XjC Li 


T .oil 




210 










215 




Airg 


Leu 


Arg 


Lys 


Asp 


Asp 


Arg 


Asp 


225 










230 






Tvr 


Ser 


Leu 


Pro 


Glu 


Gly 


Arg 


His 










24 5 








His 


Leu 


Thr 


Leu 


His 


Tvr 
X y i. 


Pro 


Thr 








260 










Ser 


Pro 


Ser 


Thr 


Pro 


Ala 


Gly 


T m 
i rp 






275 










c> o u 




Leu 


Cys 


Arg 


Glv 


Asp 


Gly 


Ser 




290 










^ mJ 




Arcf 


Leu 


Gin 


Asp 


Glu 


Gin 


Glu 


Glu 


305 










310 






Asn 


Leu 


Thr 


Leu 


Glu 


Glv 


Val 


Thr 










325 








Cys 


Arg 


Val 


Glu 


Asp 


Tvr 

1 y i 


Asp 


Ala 








340 










Thr 


Leu 


Glu 


Leu 


Ara 


Val 


Ala 


Tvr 

x y X 






355 










360 


Glv 


Lvs 


Val 


Leu 


Ser 


Leu 


Pro 


Leu 




370 










375 




Ser 


Val 


His 


Glv 


Leu 


Pro 


Thr 


Pro 


385 










390 






Thr 


Pro 


Leu 


Glv 


Asp 


Glv 


Pro 


Met 










405 








Asp 


Ser 


Asn 


Glv 


Thr 


Tvr 

1 y J. 


Val 


Cys 








420 










Val 


Leu 


Ser 


Ara 


Thr 


Gin 


Asn 


Phe 






435 










440 


Glu 


Leu 


Lys 


Thr 


Ala 


Glu 


He 


Glu 




450 










455 




Glu 


Gly 


Asp 


Glu 


Val 


Thr 


Leu 


He 


465 










470 






Pro 


Lys 


Leu 


Ser 


Trp 


Ser 


Gin 


Leu 



Ala 


His 


Pro 


Asp 


Ala 


Gin 


Ala 


Glu 


25 










30 






Val 


Glu 


Val 


Met 


Ara 


Glv 


Lys 


Ser 










45 








Glv 

^-*-y 


Thr 


His 


Asp 


His 


Tvr 
X yx. 


Met 


Leu 








60 










Gly 


Ala 


Ara 


Pro 


Ara 


Leu 


Ala 


Ser 






75 










80 


Gin 


Val 


Thr 


Met 


His 


Asp 


Thr 


Arg 




90 










95 




ASD 


Ser 


Gin 


Glv 


Arg 


Leu 


Val 


Leu 


105 










110 






Ara 


Asp 


Tvr 

X y i 


Val 


Cys 


Val 


Val 


Arg 










125 








Ala 


Thr 


Ala 


Arg 


Leu 


Asn 


Val 


Phe 








140 










Ser 


Pro 


Asn 


Lys 


Glv 


Thr 


Leu 


Ser 






155 










1 fin 


He 


Ala 


Thr 


v^y o 






Arg 


/loll 




170 










1 7 s 




Tro 


Tvr 

X yi. 


Arg 


Asn 


Gly 


Gin 


Arg 


Leu 


185 










190 






Glu 


Glv 

\j^y 


Tvr 

* y i 


Met 


Thr 


Ser 


Arg 


Thr 










205 








Ser 


Leu 


Thr 


Ser 


Thr 


Leu 


Tvr 
X yx 


Leu 








220 








Ala 


Ser 


Phe 


His 


Cys 


Ala 


Ala 


His 






235 










240 


Gly 


Arg 


Leu 


Asp 


Ser 


Pro 


Thr 

X IIX 






250 














Glu 


His 


Val 


Gin 


Phe 


1 jrp 


Val 


\3xy 


265 










270 






Val 


Arg 


Glu 


Glv 


Asp 


Thr 


Val 


Gin 


















Pro 


Ser 


Pro 


Glu 


Tvr 


Thr 


Leu 


Phe 








300 










Val 


Leu 


Asn 


Val 


Asn 


Leu 


Glu 


ox y 






315 












Arg 


Gly 


Gin 


Ser 


Gly 


Thr 


T v/r 


wxy 




330 










335 




Ala 


ASD 


Asp 


Val 


Gin 


Leu 


Ser 


Lys 


345 










350 






Leu 


Asp 


Pro 


Leu 


Glu 


Leu 


Ser 


Glu 










365 








Asn 


Ser 


Ser 


Ala 


Val 


Val 


Asn 


P v^ 








380 










Ala 


Leu 


Arg 


Tro 


Thr 


Lys 


Asp 


Ser 






395 










400 


Leu 


Ser 


Leu 


Ser 


Ser 


He 


Thr 


Phe 




410 










415 




Glu 


Ala 


Ser 


Leu 


Pro 


Thr 


Val 


Pro 


425 










430 






Thr 


Leu 


Leu 


Val 


Gin 


Gly 


Ser 


Pro 










445 








Pro 


Lys 


Ala 


Asp 


Gly 


Ser 


Trp 


Arg 








460 










Cys 


Ser 


Ala 


Arg 


Gly 


His 


Pro 


Asp 






475 










480 


Gly 


Gly 


Ser 


Pro 


Ala 


Glu 


Pro 


He 
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485 






490 




4 95 








Gly Trp Val 


Ser 


Ser Ser 


Leu 


Thr Leu Lys Val 


Thr 




ou u 












510 




Ssr Ala 


LI OCX 


Arg Asp Gly 


J. 




ox u 


TVla Oav> 7V<^w« 

Mxa oer Asn Pro 


His 










O^U 












•iUyo 


Hi «: 


V a J. c lie 


nxs 


rne oxy 


Ala 
AX a 


vai oer Pro Gin 


Thr 








3 JO 








U 






Ala CI \/ 


Vax 


AXa vaX 


wet 


Axa vax 


HX a 


"\7-»1 Oj>^^ t7^1 r^T.. 

val Ser Val Gly 


Leu 








550 






R R ^ 

ODD 


560 




XjCU V cIJ. 


Val 


Ala Val 


Phe 


Tyr Cys 


VaX 


Arg Arg Lys Gly 


Gly 






565 






570 




575 






^^ys Hrg 


Gin 


Arg Arg 


Glu 


Lys Gly 


Ala 


Pro Pro Pro Gly 


Glu 




580 








585 




590 




Pro Gly 


Leu Ser 


His 


Ser Gly 


Ser 


Glu Gin 


Pro 


Glu Gin Thr Gly 


Leu 




595 






600 






605 




Leu Met 


Gly Gly 


Ala 


Ser Gly 


Gly Ala Arg 


Gly 


Gly Ser Gly Gly 


Phe 


610 






615 








620 




Gly Asp 


Glu Cys 
















625 



















<210> 109 

<211> 3825 

<212> DNA 

<213> Homo sapiens 

<400> 109 

ccaagttcta cctcatgttt ggaggatctt gctagctatg gccctcgtac tcggctccct 60 
gttgctgctg gggctgtgcg ggaactcctt ttcaggaggg cagccttcat ccacagatgc 120 
tcctaaggct tggaattatg aattgcctgc aacaaattat gagacccaag actcccataa 180 
agctggaccc attggcattc tctttgaact agtgcatatc tttctctatg tggtacagcc 240 
gcgtgatttc ccagaagata ctttgagaaa attcttacag aaggcatatg aatccaaaat 300 
tgattatgac aagccagaaa ctgtaatctt aggtctaaag attgtctact atgaagcagg 360 
gattattcta tgctgtgtcc tggggctgct gtttattatt ctgatgcctc tggtggggta 420 
tttcttttgt atgtgtcgtt gctgtaacaa atgtggtgga gaaatgcacc agcgacagaa 480 
ggaaaatggg cccttcctga ggaaatgctt tgcaatctcc ctgttggtga tttgtataat 540 
aataagcatt ggcatcttct atggttttgt ggcaaatcac caggtaagaa cccggatcaa 600 
aaggagtcgg aaactggcag atagcaattt caaggacttg cgaactctct tgaatgaaac 660 
tccagagcaa atcaaatata tattggccca gtacaacact accaaggaca aggcgttcac 720 
agatctgaac agtatcaatt cagtgctagg aggcggaatt cttgaccgac tgagacccaa 780 
catcatccct gttcttgatg agattaagtc catggcaaca gcgatcaagg agaccaaaga 840 
ggcgttggag aacatgaaca gcaccttgaa gagcttgcac caacaaagta cacagcttag 900 
cagcagtctg accagcgtga aaactagcct gcggtcatct ctcaatgacc ctctgtgctt 960 
ggtgcatcca tcaagtgaaa cctgcaacag catcagattg tctctaagcc agctgaatag 1020 
caaccctgaa ctgaggcagc ttccacccgt ggatgcagaa cttgacaacg ttaataacgt 1080 
tcttaggaca gatttggatg gcctggtcca acagggctat caatccctta atgatatacc 1140 
tgacagagta caacgccaaa ccacgactgt cgtagcaggt atcaaaaggg tcttgaattc 1200 
cattggttca gatatcgaca atgtaactca gcgtcttcct attcaggata tactctcagc 1260 
attctctgtt tatgttaata acactgaaag ttacatccac agaaatttac ctacattgga 1320 
agagtatgat tcatactggt ggctgggtgg cctggtcatc tgctctctgc tgaccctcat 1380 
cgtgattttt tactacctgg gcttactgtg tggcgtgtgc ggctatgaca ggcatgccac 14 4 0 
cccgaccacc cgaggctgtg tctccaacac cggaggcgtc ttcctcatgg ttggagttgg 1500 
attaagtttc ctcttttgct ggatattgat gatcattgtg gttcttacct ttgtctttgg 1560 
tgcaaatgtg gaaaaactga tctgtgaacc ttacacgagc aaggaattat tccgggtttt 1620 
ggatacaccc tacttactaa atgaagactg ggaatactat ctctctggga agctatttaa 1680 
taaatcaaaa atgaagctca cttttgaaca agtttacagt gactgcaaaa aaaatagagg 1740 
cacttacggc actcttcacc tgcagaacag cttcaatatc agtgaacatc tcaacattaa 1800 
tgagcatact ggaagcataa gcagtgaatt ggaaagtctg aaggtaaatc ttaatatctt 1860 
tctgttgggt gcagcaggaa gaaaaaacct tcaggatttt gctgcttgtg gaatagacag 1920 
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ctcaggcggg 
acaatcacgt 
tatttttgtt 
gtttgttccc 
acatattgcc 
gtgtacagta 
ttataggact 
aaataatcca 
ttcatctatt 
ccatcataga 
gagagagaaa 
ggtgtaccag 
cagcctaaca 
aagatccctg 
ggtccctaaa 
tggatctgtc 
tgatgtaatg 
gttctgattc 
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ccatgaaaaa 
acaatcctgt 
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ctttccaaat 
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ggtccatcac 
ttacactgag 
tttctatcaa 
tgtgctgaac 
ataccttttg 
tgagctaaat 
aagtgtgtga 
aaagattggc 
taactccatc 
tccaaaagtg 
gttttcctgt 
tccagaccta 
ttagatctgg 
agcacaaggt 
gtgatcagca 
tgtaggcatt 
aaaggaactt 



tggtaaatcc 
aaacagtttg 
aacaattcac 
cgtcaagata 
ttctctggat 
aactaagaag 
gttctctatc 
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aggaaaagct 
ctatcgtcga 
tatggaaaat 
tatgacaagc 
tcaaagtgga 
ccaggagcaa 
tggtctctgg 
tattcaagga 
tttctattta 
aactctatca 
acacaccagt 
ttaccacaga 
aagtcaccat 
aacctacagc 
catgttccac 
tgacaaattc 
gagttgtttt 
tgccattagg 
gtatcaagtg 
tgtcccagca 
aaatattcat 
atgaagaact 
ccccccaggt 
ggaaaacaaa 



cccgcaggag 
cccccaggaa 
cagcaacgag 
cttcaacgca 
tttgctcaga 
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agtgagaaag 
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ccatcacaac 
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gacactacaa 
aatgtgatgg 
ttacaggaaa 
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tgacttcttg 
atattcttca 
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aaaagagaga 
taaaccagat 
ataaacatta 
ttcctaaaat 
tggatgaaac 
ttgcttcagg 
ggtcggacaa 
cttttcatgt 
aaaaa 



tgaatctttt 
atttgaggaa 

tccttcctat 
cagggaatgg 
acttcatcac 
caataatagg 
tggcatcgtg 
gctacattat 
tacttccggc 
aggacgtgta 
gttatcataa 
attgatagct 
gatttttggt 
aacgtagtga 
aagaatgaac 
ccttcctgt'c 
catatggggt 
ctagattcta 
gatgcatttt 
caaatgagta 
gtgctgttga 
cgcagagatt 
catgcaaaag 
gagagatctt 
gcccaattac 
attggagtgc 
gaaatatgag 
ctttgagttt 
agtttcatgt 
aatttaacgt 
gcagattgca 



1980 
2040 

2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3825 



<210> 110 
<211> 865 
<212> PRT 

<213> Homo sapiens 
<400> 110 

Met Ala Leu Val Leu Gly Ser Leu Leu Leu Leu Gly Leu Cys Gly Asn 

15 10 15 

Ser Phe Ser Gly Gly Gin Pro Ser Ser Thr Asp Ala Pro Lys Ala Trp 

20 25 30 

Asn Tyr Glu Leu Pro Ala Thr Asn Tyr Glu Thr Gin Asp Ser His Lys 

35 40 45 

Ala Gly Pro lie Gly He Leu Phe Glu Leu Val His He Phe Leu Tyr 

50 55 60 

Val Val Gin Pro Arg Asp Phe Pro Glu Asp Thr Leu Arg Lys Phe Leu 
65 70 75 80 

Gin Lys Ala Tyr Glu Ser Lys He Asp Tyr Asp Lys Pro Glu Thr Val 

85 90 95 

He Leu Gly Leu Lys He Val Tyr Tyr Glu Ala Gly He He Leu Cys 

100 105 110 

Cys Val Leu Gly Leu Leu Phe He He Leu Met Pro Leu Val Gly Tyr 

115 120 125 

Phe Phe Cys Met Cys Arg Cys Cys Asn Lys Cys Gly Gly Glu Met His 

130 135 140 

Gin Arg Gin Lys Glu Asn Gly Pro Phe Leu Arg Lys Cys Phe Ala He 
145 150 155 160 



wo 03/000012 



138 



PCT/US02/19773 



ser 


Leu 


Leu 


Va 1 

vax 


X xe 


Cys 


T 1 o 

xxe 


X XC 


T 1 o 

xxe 


OCX 


Tip 
xxe 


oxy 


Tip 
xxe 


Php 

It lie 


iyr 


Gl V 
ox y 




















1 70 
1 / V 










1 7S 

1 » 3 






\7a 1 
V o Jl 


Ala 
AXo 


Asn 


Hie 
nx 0 


CZI n 
vsxn 


Va 1 


a 

r\x ^ 


Th-r 
1 llx 


/ix ^ 


Tip 
xxe 


xiy a 


Arrt 

nX ^ 


^pr 
oex 


a TTf 

nx ^ 


Lys 








xou 










1 

xpo 










1 QO 






Leu 


riXa 


Asp 


Ser 


a e n 




T \/c 


r\op 


Leu 


a i^n 
rix ^ 


Thr 

1 1 IX 


T.pn 
XiC u 


T .P1 1 
XlC u 


aQn 
nail 


Gl n 
ox u 


Thr 






1 y D 










9nn 










9ns 

^ U3 








Pro 


o J. u 


oxn 


± Xc; 


T \7 Q 

j_i y o 


iyr 


X X t; 


T .on 
XjC U 


AT a 


OXIl 


Tv/r 
1 yr 


Acn 


Thr 


Thr 

1 IIX 


T.VQ 

xjy a 


Asp 




^ i. u 










91 S 

^ X ^ 










220 










Lys 




rne 


inr 




Leu 


Asn 


OCX 


T 1 <a 

xxe 


a <sn 


^pr 

OCX 


Val 
V dx 


T .pn 


Gl V 
ox y 


Gl V 
oxy 


Gl V 
ox y 












^ V 










^ o o 










240 


T 1 a 


Leu 


Asp 


7\ y /"i 




a r*rt 
x^x ^ 


£ X W 




Tie* 
xxe 


Tip 
X xc 


Prrt 


Val 
vox 


ucu 


A^n 
na ^ 


Glu 


He 










94 R 
^ n J 










9*=i0 










255 




Lys 








Thr 


AT a 

AX CI 


X xc 


xjy a 


ox Li 


Thr 

1 1 IX 


xiy a 


ox u 


Ala 


TiPU 

XJC w. 


Glu 


Asn 






9 fin 

^ O V 










^ V <J 










270 






Met 


Asn 


Ser 


1 nr 


_ 

Leu 


Lys 


OCX 


Leu 


Hi <s 
nx a 


(^1 n 
oxn 


f^l n 
OX n 


OCX 


Thr 

X 1 IX 


Gl n 
ox 1 1 


XlC u 


<5pr 

OCX 
















9Rn 

/LOU 










9ft 








oer 


Ser 


Leu 


1 llX 


OC X 


Va 1 
V d X 


Lys 


i { IX 


OCX 


Leu 


a rrr 

r\JL ^ 


^pr 

OCX 


Qp r 

OCX 


Leu 


a cn 


a <5r» 














9QR 










3 V W 










Pro 


Leu 


v_.ys 


Leu 


Va 1 
vox 


Hi Q 
n X o 


irx 


OCX 


^^r 

OCX 


f;i n 

ox u 


Th r 

J. 1 1 X 


^y a 


Asn 


Ser 


He 


Arg 


"3 r» ci 










J XU 










3X3 










■^90 


Leu 




Leu 


OCX 


o X 1 1 




a Qn 


OCX 


Asn 


Prn 

t X Vi7 


fil n 
ox u 


Leu 


Arg 


Gin 


Leu 


Pro 




















?'^n 

o o v 










J ^ 




Pro 


Vox 


Asp 


a 1 a 


V3X li 


Leu 


Asp 


Asn 


Va 1 
V d X 


a cn 
nail 


rial 1 


Va 1 
vox 


Leu 


a rn 
nx 


Thr 

X liX 


na^ 


















3 4 3 










33 U 






Leu 


Asp 


biy 


Leu 


vax 


oxn 


vaXn 


n T7 

VaXy 


lyr 


oxn 


ocr 


Leu 


Asn 


Asp 


Tip 
xxe 


t X L7 






T Q 










1 fin 
3 ou 










303 








Asp 


Ar g 


vax 


oxn 


Arg 


oxn 


i nx 


Thy 
. 1 1 IX 


Th-r 

I nx 


Va 1 
V d X 


Va 1 
V d X 


ai a 

r\Xci 


r;i w 
oxy 


Tip 
xxe 


Lys 


nX y 




"7 n 
J / u 










J / 3 










30 V 










Vo± 


Leu 


Asn 


OCX 


X XC7 


vjxy 


OCX 


a cr^ 


Tip 
xxe 


a 

na^ 


Asn 


Val 


Thx 


Gin 


Arg 


Leu 






















3173 










AOd 

H \J\J 


Pro 




oxn 


Asp 


X X e 


Leu 


OCX 


ai a 

AXd 


Php 

It ilC 


9pr 

OCX 


Va 1 
V d X 


T\/r 

iyr 


Val 


Asn 


a «; n 
na i 1 


Thr 

J. IIX 










d n ^ 
^ uo 










4 1 n 

M X U 










4 1 S 
*J X 3 




PI 11 

<aXU 


Ser 


Tyr 


X Xc 


nxb 


Arg 


Asn 


Leu 


ZT X vJ 


Thr 

1 IIX 


Leu 


Gl n 
ox u 


Gl n 

ox LI 


Tvr 
iyr 


a Qn 
na^ 


Ser 








^ 9n 










4 9 R 
ft ^ 3 










A "^0 

1 3 W 






Tyr 


Trp 


Trp 


Leu 


oxy 


vjxy 


Leu 


Va 1 

vax 


T 1 o 

xxe 


Cys 


OCX 


Leu 


Leu 


Thr 

1 IIX 


Leu 


Tip 
X xc 
















/I / n 

fl fl u 










zl /I ^ 










T 1 


irne 


Tyr 


Tyr 


Leu 


oxy 


Leu 


Leu 


Cys 


ox y 


vax 


^y s 


GT \7 
oxy 


iyr 


Asp 














4 ^ 

*i 3 3 










d fiO 










ArQ 


tixs 


a 1 a 


1 ILL. 


IT i O 


Th-r 


THt- 

X ilX 


/IX ^ 


oxy 


v^y a 


Va 1 
V d X 


c;pr 

OCX 


a G rt 
nail 


Thr 

X IIX 


Gl V 
oxy 


Gly 












H f \J 










H f ^ 










480 


1 

Va X 




JucU 




Va 1 
V d X 


ox y 


Val 
V dx 


V3xy 


XjC Li 


Ser 


Phe 


XtC LI 


Phe 


Cys 


Trn 
i x^ 


He 










4 P R 
fl 0 0 




















*i Z/3 




Leu 




T 1 o 
X Xc 


X xc 


Va 1 
V dx 


Va 1 
V d X 


XlCU 


Thr 

X i 1 X 


Php 
r^i ic 


Va 1 
vox 


Php 


Gl V 

oxy 


Al a 

nX d 


Asn 


Val 


Glu 








D\J\J 










j\j -J 










510 






Lys 


Leu 


He 


Cys 


Glu 


Pro 


Tvr 
xyx 


Thr 


Ser 


Lys 


Glu 


Leu 


Phe 


Arg 


Val 


Leu 






J X J 










R9n 

«J ^ VJ 










=^9 S 








Asp 


Thr 


Pro 


i yr 


Leu 


Leu 


a c 1^ 


ox u 


a cr^ 
ria^ 


irp 


ox u 


Tv/r 

iyr 


T\7r 

iyr 


7 .p 1 1 

XjC L1 


OCX 


Gl V 
ox y 




530 










o «^ 










S4 n 










Lys 


Leu 








OCX 


xjy a 


1 4C Li 


uy a 


XlC u 


Thr 

X IIX 


Php 

ilC 


Glu 
ox u 


Gin 

OXli 


Val 

vox 


Tvr 


545 




















•J «J 3 










560 


Ser Asp 






Lys 




Arg 


Gly 


Thr 


Tvr 
lyx 


Gly 


Thr 


Leu 


His 


Leu 


Gin 




















570 










575 




Asn 


Ser 


Phe 


Asn 


He 


Ser 


Glu 


His 


Leu 


Asn 


He 


Asn 


Glu 


His 


Thr 


Gly 








580 










585 










590 






Ser 


He 


Ser 


Ser 


Glu 


Leu 


Glu 


Ser 


Leu 


Lys 


Val 


Asn 


Leu 


Asn 


He 


Phe 






595 










600 










605 








Leu 


Leu 


Gly 


Ala 


Ala 


Gly 


Arg 


Lys 


Asn 


Leu 


Gin 


Asp 


Phe 


Ala 


Ala 


Cys 




610 










615 










620 










Gly lie 


Asp 


Arg 


Met 


Asn 


Tyr 


Asp 


Ser 


Tyr 


Leu 


Ala 


Gin 


Thr 


Gly 


Lys 



wo 03/000012 



139 



PCT/US02/19773 









0 JU 






















S©r Pro 


Ax a 


\3±y vax 


Asn 


Leu 


Leu 


OC?X 


Ph(=k 

C It's 


Al a 


Tyr 


Asp 


Leu 


/^l n 
OXU 


Ala 
AX a 
















650 














Xj]^S AJ.3. 


Asn 


Ser Leu 


rx O 


trx o 


V3xy 






Am 
rtx y 


Asn 


OCX 


Leu 


Lys 


rtx y 






























-ASp n.±a. 


m n 


1 nx lie 


Lys 


inx 


X xcs 


nx o 


f^l n 

UXll 




Arg 


vax 


Leu 


Pro 


T 1 o 
1 Xc 


D / 0 








DO U 


















^2^ n m n 


OCX 


Xieii otsx 


Thr- 

1 ilX 


XlCU 


TvT 
1 yx 


Gin 


^pr 


Val 


Lys 


T 1 e» 

X xc 


XiCU 


^^1 n 

V3X 1 1 


A y c\ 
nx y 


oy u 








COR 










7nn 












Asn 


_ 

Gly Leu 


Leu 




nx ^ 


vax 


Thr 


A r 
nX 


Tic* 
± Xc 


Leu 


Ala 
AXa 


OcX 


Leu 


the; 






7 1 n 










71 R 

/ X O 










7 9n 








rne 


Tin 

JL xe 


Th T- 

1 IIX 


Asn 


A o 


Thr 


oex 


OCX 




T 1 o 
X Xc 


1 Xc 
















7^0 










7 S 




oXU OJ.U 


1 nx 


Tito T^/o 

jjys ijys 


Tyr 


oxy 


riX ^ 


1 XIX 


X X c 


T 1 P 

X X 


uxy 


lyr 


Php 
It lie 


f;i n 

^X u 


nxo 






1 Ad 








7^ R 
















I yr IjSU 


p 1 1-» 
bxn 


Trp He 


IjXU 


f ne 


C O T- 

oex 


X xe 


OCX 


f^l 11 

oXU 


Lys 


1 

vax 


Ala 
AXa 


Ser 


^ys 




7 c; 








7 fin 


















Lys Pro 


V d X 


Ala Thr 


Ala 


Leu 




Thr 


Al a • 
r^xci 


Val 


A cjn 


Va 1 
V ax 


Ph<:k 

JTllC 


XlC u 


v^y o 


1 1 \j 








77 










7ftn 

/ 0 U 










ber lyr 


lie 


He Asp 


Pro 


Leu 




Leu 


Ph^ 

It J Ic 


Trr» 

irp 


Php 

t lie 


f^l \/ 
ox y 


T 1 <a 
X Xc 


ox y 


Lys 


"7 Q e; 






790 










7 










ft no 

0 vj W 


/^X O J- i iX 


Val 


Phe Leu 


Leu 


Pro 


Ala 


Leu 


lie 


Phe 


Ala 


Val 


Lys 


Leu 


Ala 






805 










810 










815 




Lys Tyr 


Tyr 


Arg Arg Met 


Asp 


Ser 


Glu 


Asp 


Val 


Tyr 


Asp 


Asp 


Val 


Glu 






820 








825 










830 






Thr He 


Pro 


Met Lys 


Asn 


Met 


Glu 


Asn 


Gly 


Asn 


Asn 


Gly 


Tyr 


His 


Lys 




835 








840 










845 








Asp His 


Val 


Tyr Gly 


He 


His 


Asn 


Pro 


Val 


Met 


Thr 


Ser 


Pro 


Ser 


Gin 



850 855 860 



His 
865 



<210> 111 
<211> 673 
<212> DNA 
<213> Homo sapiens 

<400> 111 

ttgaattcgc caaggctggg tttccctcat 
tcttctcctt ggcatacagc tcacagctct 
ctcccgggtg ctggaggctg ttaatgggac 
ctttgcccct gtgggtgatg ctctaacagt 
acctgagcag tttgtattct actaccacat 
taaggaccgg gtgtcttggg atgggaatcc 
gaaactgcag ttcgacgaca atgggacata 
tgatggggtg ataggggaga tccggctcag 
ccacttcctg gctctggcca ttggctctgc 
agtggtcctc ttccagcatt accggaaaaa 
ggagataaaa tcaaaagaag aggaaaggct 
agaagacaca gac 

<210> 112 

<211> 215 

<212> PRT 

<213> Homo sapiens 



gtatggcaag agctctactc gtgcggtgct 60 
ttggcctata gcagctgtgg aaatttatac 120 
agatgctcgg ttaaaatgca ctttctccag 180 
gacctggaat tttcgtcctc tagacggggg 240 
agatcccttc caacccatga gtgggcggtt 300 
tgagcggtac gatgcctcca tccttctctg 360 
cacctgccag gtgaagaacc cacctgatgt 420 
cgtcgtgcac actgtacgct tctctgagat 480 
ctgtgcactg atgatcataa tagtaattgt 540 
gcgatgggcc gaaagagctc ataaagtggt 600 
caaccaagag aaaaaggtct ctgtttattt 660 

673 



<400> 112 

Met Tyr Gly Lys Ser Ser Thr Arg Ala Val Leu 



Leu Leu Leu Gly He 
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